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PROGRAM CONTAINING BROWSER AND MULTIMEDIA CONTENT 

5 Technical Field 

The present invention relates to a program containing a 
browser and a multimedia content, and more particularly to a program 
containing a browser capable of dealing with a sophisticated 
multimedia content with ease, and a multimedia content. 

10 

Background Art 

In recent years, the dissemination of information 
networks, particularly the Internet, have been striking. W.W.W 
(World Wide Web) among others is essential for services not only 
15 for businesses but also for general home users. W.W.W is services 
allowing a reception terminal to access a variety of content (home 
pages) existing on the Internet and obtain necessary information. 

In recent years, general users also use W.W.W as a tool 
20 for not only collecting content but also for enjoying the content 
rich in entertainment or presenting their own content. It is 
expected that the number of users is increasing more and more in 
the future. 

W.W.W is a software architecture developed by the CERN 
25 (Conseil Europ'en pour la Recherche Nucl'aire) in 1989. W.W.W is 
the communication system constituted of a software group constituted 
of a so-called HTTP (HyperText Transfer Protocol) server and an HTML 
(HyperText Markup Language) browser interconnected by the Internet, 
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and is supported by software HTTP and HTML. HTTP is the 
communication protocol between an HTTP server and an HTML browser, 
and HTML is the format of (homepage) data exchanged between an HTTP 
server and an HTML browser. 
5 Fig. 1 shows a mechanism of producing a W.W.W content 

(homepage) of this type. 

As shown in Fig. 1, in a W.W.W system 1, a reception 
terminal 11 and an HTTP server 13 are interconnected by the Internet, 
the reception terminal being constituted of an HTML browser 22 and 
10 a platform 21 made of a predetermined OS (Operating System) , hardware 
and the like. 

Description will be made on a process of reproducing 
content on the HTTP server 13 by the reception terminal 11. 

Upon reception of an URL (Uniform Resource Locater) of 
15 a homepage from a user as shown by an arrow 31, the HTML browser 
22 of the reception terminal 11 accesses the HTTP server 13 designated 
by the input URL via the Internet 12 as shown by an arrow 32 to request 
for the data of the homepage designated by the URL. 

In response to the request, the HTTP server 13 transmits 
20 the data file (hereinafter called an HTML document) 33-1 of the 
homepage described in the HTML format via the Internet 12. 
The HTML browser 22 of the reception terminal 11 displays the received 
HTML text 33-1 on a display (not shown) of the reception terminal 
11, for example, in the format of a display 34. The HTML browser 
25 22 acquires related data 33-2 such as a bitmap file (image file) , 
when necessary, from the HTTP server 13, and displays it in a 
predetermined area of the display 34. 

The HTML browser 22 is therefore essential software for 



2 



general users utilizing W.W.W. HTML has been developed in order 
for researchers to share text data such as technical documents via 
information networks, and has distinctive characteristics that 
documents are structured by using an approach called Markup. 

5 

Fig. 2 shows an HTML text and its display example. 

In the HTML text 33-1, the portion sandwiched between "<" 
and ">" is called a tag which is used for Markup. 

10 The contents of the portion sandwiched between HTML tags 

(<HTML> and </HTML>) indicate HTML data. The portion sandwiched 
between BODY tags (<B0DY> and </B0DY>) indicates the main text of 
the document. A portion 41 "<H1>PLEASANT FOOD SHOP IN 
KAMAKURA</H1>" sandwiched between HI tags (<H1> and </Hl>) indicates 

15 that the character string "PLEASANT FOOD SHOP IN KAMAKURA" is a 
headline of the document. Portions 42 and 43 sandwiched between 
P tags (<P> and </P>) indicate sentences of the documents. 
Standardization of such tag types is managed by W3C (World Wide Web 
Consortium) . 

20 A display 34 provides the content in conformity with the 

structure of the HTML document 33-1. 

A character string corresponding to the portion 41 
sandwiched between HI tags is displayed in the upper area 51 of the 
display 34, a sentence corresponding to the portion 42 sandwiched 
25 between the P tags is displayed in the lower area 52, and a sentence 
corresponding to the portion 43 sandwiched between the P tags is 
displayed in an area lower than the area 52. 

Since the HTML document 33-1 is expressed by text, HTML 
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is not dependent upon a platform such as OS. Since HTML is a text 
format , a user and the like can create a content in a relatively 
straightforward manner by using HTML with some learning. 

HTML can insert an image into a document by using <IMG> 
5 tags. For example, an image 33-2 is displayed in an area 54 in the 
area 52 of the display 34, the image 33-2 corresponding to an image 
file (food.jpg) corresponding to a portion 44 sandwiched between 
<IMG> tags in the portion 42 sandwiched between the P tags in the 
HTML document 33-1. 

10 Since HTML is provided with tags for expressing media 

other than the text, HTML can deal with a combination of a plurality 
of media such as texts, images (still images and moving images) and 
audio. The content of this type is hereinafter called a multimedia 
content. Therefore, most of W.W.W data known as so-called homepages 

1 5 are a multimedia content . 

As above, since HTML is characterized in that it does not 
depend upon a platform, it can deal with multimedia information, 
and it can create content in a relatively straightforward manner. 

20 Services for distributing a multimedia content are 

prevailing and HTML derivative technologies have been adopted in 
information networks for digital electronic home appliances such 
as mobile phone networks and digital broadcasting, in addition to 
the Internet. For example, BML (Broadcast Markup Language) ,. Compact 

25 HTML and the like are known, BML being adopted for data broadcasting 
services of BS (Broadcasting Satellite) and Compact HTML being used 
by the i-mode for mobile phones by the NTT DoCoMo, Inc. These 
services are often supplied nowadays on the assumption that they 
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are used only by particular digital home electronic appliances or 
particular information networks, and mutual distribution of content 
is not possible or limitative. 

It is expected in the future that all digital electronic 
5 home appliances including computers are interconnected by one 
information network. In such a case, it is requested that any 
terminal can receive services of the same content and that the content 
is highly entertaining, has sophisticated functions, high 
performance as well as being apt to be developed under reduced cost. 

10 

As described earlier, however, HTML has been developed 
in order for researchers to share text data, and is not developed 
as a system for satisfying the needs expected to become essential 
in the future. As solutions for satisfying such needs, HTML has 

15 therefore the following issues left to be solved. 

Namely, as described above, HTML is the format for 
expressing a structured document, i.e., text media. In HTML, 
therefore, media (images and the like) other than the text is inserted 
into a text (document) . 

20 However, if other media are inserted into a document, 

there arises a first issue that consistency of presentation is not 
guaranteed. 

More specifically, although the position of media 
inserted into a text is determined by the text layout, the text is 
25 returned when it reaches the end of the window (for example, as shown 
in Fig. 2, in the area 52 of the display 34, the text is returned 
at "PASTER HOUSE" and the following "HAS BEEN FOUND" is displayed 
in the next row) . The display position of media is not determined 
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therefore until the window size of an HTML browser is determined 
(until a user displays content) . 

A display 34-1 shown in Fig. 3 and a display 34-2 shown 
in Fig. 4 are examples of presentations having different media 
5 display positions because of different window sizes, even with the 
same HTML document, i.e. , the displays are examples of presentations 
not guaranteeing presentation consistency. 

In other words, although a layout method of a display by 
HTML is a suitable method of displaying media such as images as part 

10 of a document, it is not a suitable method of displaying a combination 
of various media as a multimedia content. 

The reason for this is that considerable attention is paid 
to the design of a multimedia content in many cases , and that a content 
creator often desires that the positions of respective media to be 

15 presented can be designated by absolute values and the content of 
the same design can be presented irrespective of the window size 
of an HTML browser. Namely, the content creator does not desire 
to display differently the created multimedia content as in the 
display 34-1 (Fig. 3) or display 34-2 (Fig. 4) depending upon the 

20 window size of an HTML browser. The content creator desires that, 
as shown in a display 34-3 of Fig. 5 each of a title text, two main 
texts and an image is displayed by designating the absolute positions 
(in the example of Fig. 5, absolute values from the upper left corner) 
of the fixed coordinate system of the display 34-3. 

25 In addition, since the HTML format defines only the text 

structure, the layout details such as a font type, a space between 
characters and a space between rows, a hyperlink expression method 
(although expression is made often by using blue fonts and underlines, 
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definitions thereof do not exist) and the like are fundamentally 
dependent upon an HTML browser, when HTML data is actually displayed. 
If the same HTML document is presented by HTML browsers developed 
by different vendors, it may be displayed minutely different. This 
5 shows that presentation consistency is not guaranteed. 

Under such circumstances, CESS (Cascading Style Sheets) 
has been proposed as a mechanism of designating the style of an HTML 
document. By utilizing CESS, it is possible to unanimously 
designate the layout of a text. However, CESS is used persistently 

10 for the layout of a document, HTML browsers of companies support 
in different manners and as will be later described, CESS does not 
provide services for 3D (3 Dimensional) . Further, CESS has its own 
grammar (different from HTML) and content creators are required to 
be familiar with it. 

15 As above HTML is associated with the first issue that 

presentation consistency is not guaranteed which is the essential 
condition of multimedia expression., 

Further, HTML is associated with a second issue that no 
synchronizing mechanism between media is provided, from the view 

20 point of multimedia expression. 

The synchronizing mechanism between media is a means for 
controlling the presentation timings when a combination of media 
having a time axis, such as audio, moving images and animation, is 
presented. Use of the synchronizing mechanism between media allows 

25 expressions, for example, reproduction of animation synchronized 
with music and concurrent or continuous reproduction of a plurality 
of moving images. As above, the synchronizing mechanism between 
media is a function essential for realizing a multimedia content. 
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Further, HTML is associated with the third issue that 
there is no compatibility of the content between different types 
of apparatuses (e.g., between a personal computer and a digital 
electronic home appliance) . 
5 Namely, although HTML has no specification regarding the 

interface for manipulating a content, many HTML browsers on personal 
computers are realized by the interfaces of personal computers, e.g. , 
by a window provided by the Windows (registered trademark) OS. 

10 Since the size of a window can be changed, the size of 

an HTML browser can also be changed. Namely, the size in a browser 
horizontal direction is not definite so that the content unable to 
be displayed is developed downward and not displayed. In order to 
display it, a scroll in the vertical direction is used. A mouse 

15 is used for a content manipulation such as a jump to another HTML 
document through hyper link. A content is created on the assumption 
of a manipulation performance dependent upon the personal computer. 

In contrast, although an HTML text can be displayed on 
20 a digital electronic home appliance, the display size of an HTML 
browser is generally fixed and a scroll function is not provided 
in some cases. Since it is necessary to be accustomed to the 
manipulation of a moving cursor such as a mouse, the moving cursor 
is considered unsuitable for electronic home appliances, so that 
25 the function corresponding to a mouse is not provided in many cases. 
Therefore, when HTML is applied to digital electronic home 
appliances, the HTML format is generally modified or extended in 
some way, in order to allow the interface of a digital electronic 



home appliances to use HTML. 

This difference between a personal computer and a digital 
electronic home appliance results in a possibility that 
compatibility is not ensured if an HTML content distributed on the 
5 Internet is distributed to a network of digital electronic home 
appliances . In order to absorb this difference, conversion software 
is used in a partial field. Since an HTML content on the Internet 
is created based upon an interface of a personal computer as described 
above, even if the HTML content converted by the conversion software 
10 is produced, the content may become meaningless presentation. 



As above, HTML is associated with the third issue of no 
compatibility between contents. 

HTML has also a fourth issue of a limit in function 

15 extension. 

Namely, as described above, HTML is primarily structured 
document data and is not an application program. An HTML browser 
can therefore display only the data prepared in an HTTP server. The 
types (media) of data capable of being displayed is limited to only 
20 those defined by the HTML format. 

For example, when newly developed media are used, it is 
necessary to perform extension of the media with regard to the HTML 
format standardized by the above-described W3C. In order also for 
the media to be used, the HTML browser of each company is required 
25 to support a software module for reproducing the new media and 
correctly recognize the extended portion, i.e., tags. 

However, in practice, the HTML browser of each company 
extends tags often in an independent manner, and there may be no 
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compatibility between these browsers. Also in some cases, an HTML 
content itself is dependent upon specific extension of a particular 
HTML browser. 

There is a plug-in function as another method of extending 
the function of an HTML content. A plug-in is a software module 
of extending a function of an HTML browser. If a plug-in realizing 
an optional function is installed in an HTML browser, this function 
can be executed by HTML content. However, in order for a user to 
use the plug-in, it is necessary to perform in advance its download, 
install and upgrade. The plug-in is required to be developed for 
each of OSes and browsers, and the HTML browser not supporting the 
plug-in cannot display the HTML content using the plug-in. 

More correctly, the extension function by a plug-in 
incorporates the mode that another application runs in a partial 
region of an HTML content, and the function is not contained in the 
content itself and the HTML itself is a structured document 
unchangeably. 

There is Java (R) Script as an extension aiming at making 
an HTML document itself execute an optional logic. Java (R) Script 
is a script language, and similar to a C language and a Java (R) 
language, has a grammar considering that even a content producer 
such as a designer and a content creator can write in a relatively 
straightforward manner, while supporting the control structure such 
as conditional branch and iteration. Code of Java (R) Script is 
characterized by ease of use such as not requiring advanced 
compilation since it is executed by an interpreter. Since 
Java (R) Script places importance to use easiness, it is inferior to 
the C language and Java(R) language in terms of an execution 
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performance, a program size and program structuralization. 

From these reasons, although Java (R) Script is suitable 
for media and a plug-in provided by HTML and a relatively simple 
application such as manipulating a browser itself, it is often 
5 unpractical to realize a complicated function (e.g., decoding 
particular media, and GUI components) by Java (R) Script itself. 

As above, HTML is associated with the fourth issue of a 
limit in its extension function. 

HTML has also a fifth issue of inferior productivity of 

10 content. 

Namely, as described above, HTML has been developed in 
order to allow researchers to share structured document data. Being 
limited to the usage of this kind, it can be said that writing a 
content with a tool such as a text editor in accordance with the 
15 HTML format is a relatively easy work for researchers well grounded 
in software. 

In contrast, when designers and creators form a content 
which is rich in entertainment and complicated, it is rare that an 
HTML document is written directly by using a text editor or the like, 

20 but dedicated authoring software is generally used. This software 
presents a graphical content forming environment so that the content 
having a relatively complicated layout can be formed without 
knowledge of HTML. 

The HTML format itself is, however, not designed by 

25 considering affinity with the authoring software. Harmful 
influences of this include an overhead of a data size to be caused 
by the software mechanically forming an HTML document, difficulty 
in compatibility between authoring software and in fine adjustment 
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at the content finishing stage, low readability of formed HTML 
documents, and the like, thus leading to the situation that 
advantages of HTML become rather disadvantages. 

As described earlier, when the extension function is 
5 mounted in an HTML content, a script language such as Java (R) Script 
is often used and designers and content creators often write code 
of Java (R) Script by themselves even for a more or less complicated 
content. 

However, from the viewpoint of content productivity, the 
10 extension functions of this type are preferably developed by 
programmers having high skills of software by using a program 
language such as a C language and a Java(R) language. Also from 
the viewpoint of content productivity, it is preferable that 
designers and creators pay attention to how the extension functions 
15 are customized and combined and which kind of services are realized, 
and that the script language such as Java (R) Script is used when 
designers and content creators combine and control these extension 
functions . 

However, presently, specialization in the production of 
20 an HTML content is still not established, and in addition the HTML 

format itself is not designed by considering affinity with 

development flows of this type. 

From the viewpoint of content productivity, although the 

extension function once developed should be reused, it cannot be 
25 said that the script language such as Java (R) Script sufficiently 

supports the function regarding software modularization (easy to 

customize, capsulated mount, software module distribution and the 

like), and it is therefore difficult to reuse. 
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As above, HTML has the fifth issue of inferior content 
productivity . 

HTML has also a sixth issue of difficulty in sufficient 
manifestation of a hardware performance. 
5 Specifically, it is expected that the function and 

performance of hardware of digital electronic home appliances 
including personal computers are improved more and more in the future. 
There is a large possibility that highly sophisticated 3D graphics 
functions as realized in some home games, among others, are adopted 

10 in other products in the near future. 

In contrast, HTML is software aiming primarily at sharing 
structured documents, and is unsuitable for expressing a content 
assuming 3D. As described above, although it is possible to extend 
the function of an HTML content by using a plug-in or a plug-in such 

15 as Java (R) Script, they are associated with the above-described 
issues and HTML is not always a proper solution to manifest the 
expected hardware performance that digital electronic home 
appliances have in the near future. 

As above, HTML is a software architecture developed in 

20 order to meet the needs of sharing document data among researchers. 
From the viewpoint of sharing document data, it can be considered 
that HTML is one of the most refined solutions even at the present 
time. However, the needs of content distribution on present 
information networks are sophisticated-function-orientated and 

25 multimedia-orientated, and it is required that content producers 
of an occupation type such as designers and content creators can 
produce a content in a straightforward manner. 

In order to deal with a change in the needs of this kind, 
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although extension of various functions has been made heretofore 
to HTML, since it does not follow the design idea inherent to HTML, 
the fundamental solution is still not reached in the context of 
content compatibility, productivity and the like and HTML is 
5 associated with the above-described first to sixth issues. 

Disclosure of the Invention 

The present invention has been conceived in consideration 
of the circumstances described above, and allows to dealing with 
10 highly functional multimedia content in a straightforward manner, 

A first program according to the present invention is 
characterized by including: browser software for providing each 
component with each service in accordance with specifications to 

15 be satisfied in order for the component to receive the service; and 
a multimedia content formed by a collection of scenes each containing 
a component group formed by customizing and arbitrarily combining 
a plurality of the components operating on the browser software and 
a script for operating the component, the service including: a 

20 service for reading a predetermined one of the multimedia content 
and loading the component group and the script contained in the scene 
constituting the multimedia content; a service for managing a life 
cycle of each component constituting the component group; a service 
for displaying the component in a predetermined 3D virtual space 

25 and erasing the component already displayed in the 3D virtual space; 
a service for providing a communication service between an 
interpreter of the script and the component; a service for managing 
focus transition between a predetermined one of the components and 
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another of the components; and a service for distributing an input 
from a predetermined external key to a corresponding one of the 
components. 

The browser software may further provide the component 
5 with the service in accordance with the specifications to be 
satisfied in order for the component to receive the service, the 
service further includes: a service for synchronously reproducing 
a plurality of media; and a transition service for transition between 
first and second ones of the scenes. 

10 The multimedia content may be configured to maintain 

presentation consistency when the multimedia content is reproduced 
on the browser software. 

A first recording medium recording a program according 
to the present invention is characterized in that the program 

15 includes: browser software for providing each component with each 
service in accordance with specifications to be satisfied in order 
for the component to receive the service; and a multimedia content 
formed by a collection of scenes each containing a component group 
formed by customizing and arbitrarily combining a plurality of the 

20 components operating on the browser software and a script for 
operating the component, the service including: a service for 
reading a predetermined one of the multimedia content and loading 
the component group and the script contained in the scene 
constituting the multimedia content; a service for managing a life 

25 cycle of each component constituting the component group; a service 
for displaying the component in a predetermined 3D virtual space 
and erasing the component already displayed in the 3D virtual space; 
a service for providing a communication service between an 
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interpreter of the script and the component; a service for managing 
focus transition between a predetermined one of the components and 
another of the components; and a service for distributing an input 
from a predetermined external key to a corresponding one of the 
5 components. 

The first program and recording medium of the present 
invention provide the program which contains browser software for 
providing each component with each service in accordance with 
specifications to be satisfied in order for the component to receive 

10 the service; and a multimedia content formed by a collection of scenes 
each containing a component group formed by customizing and 
arbitrarily combining a plurality of the components operating on 
the browser software and a script for operating the component. 

A second program as a multimedia content according to the 

15 present invention is characterized by a collection of a plurality 
of scenes each containing: a component group formed by customizing 
and arbitrarily combining a plurality of components operating on 
browser software for providing the component with each service in 
accordance with specifications to be satisfied in order for the 

20 component to receive the service; and a script for operating the 
component , the service including: a service for reading a 
predetermined one of the multimedia content and loading the 
component group and the script contained in the scene constituting 
the multimedia content; a service for managing a life cycle of each 

25 component constituting the component group; a service for displaying 
the component in a predetermined 3D virtual space and erasing the 
component already displayed in the 3D virtual space; a service for 
providing a communication service between an interpreter of the 
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script and the component; a service for managing focus transition 
between a predetermined one of the components and another of the 
components; and a service for distributing an input from a 
predetermined external key to a corresponding one of the components. 

A second recording medium storing a program as a 
multimedia content according to the present invention is 
characterized in that the program includes a collection of a 
plurality of scenes each containing: a component group formed by 
customizing and arbitrarily combining a plurality of components 
operating on browser software for providing the component with the 
service in accordance with specifications to be satisfied in order 
for the component to receive the service; and a script for operating 
the component, the service including: a service for reading a 
predetermined one of the multimedia content and loading the 
component group and the script contained in the scene constituting 
the multimedia content; a service for managing a life cycle of each 
component constituting the component group; a service for displaying 
the component in a predetermined 3D virtual space and erasing the 
component already displayed in the 3D virtual space; a service for 
providing a communication service between an interpreter of the 
script and the component; a service for managing focus transition 
between a predetermined one of the components and another of the 
components; and a service for distributing an input from a 
predetermined external key to a corresponding one of the components. 

The second program and recording medium of the present 
invention provide as the program a collection of a plurality of scenes 
each containing: a component group formed by customizing and 
arbitrarily combining a plurality of components operating on browser 
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software for providing the component with the service in accordance 
with specifications to be satisfied in order for the component to 
receive the service; and a script for operating the component. 

A third program as browser software according to the 
5 present invention is characterized by providing each component with 
each service in accordance with specifications to be satisfied in 
order for said component to receive said service; said service 
including: a service for reading a predetermined one of said 
multimedia content and loading said component group and said script 

10 contained in said scene constituting said multimedia content; a 
service for managing a life cycle of each component constituting 
said component group; a service for displaying said component in 
a predetermined 3D virtual space and erasing said component already 
displayed in said 3D virtual space; a service for providing a 

15 communication service between an interpreter of said script and said 
component; a service for managing focus transition between a 
predetermined one of said components and another of said components; 
and a service for distributing an input from a predetermined external 
key to a corresponding one of said components. 

20 A third recording medium storing a program as browser 

software and a multimedia content according to the present invention 
is characterized in that each component is provided with each service 
in accordance with specifications to be satisfied in order for the 
component to receive the service; the service including: a 

25 service for reading a predetermined one of the multimedia content 
and loading the component group and the script contained in the scene 
constituting the multimedia content; a service for managing a life 
cycle of each component constituting the component group; a service 
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for displaying the component in a predetermined 3D virtual space 
and erasing the component already displayed in the 3D virtual space; 
a service for providing a communication service between an 
interpreter of the script and the component; a service for managing 
5 focus transition between a predetermined one of the components and 
another of the components; and a service for distributing an input 
from a predetermined external key to a corresponding one of the 
components . 

The third program and recording medium of the present 

10 invention provide the browser software for supplying each component 
with each service in accordance with the specifications to be 
satisfied in order for the component to receive the service. 

A fourth program according to the present invention is 
characterized by including a step of producing a multimedia content 

1 5 containing a collection of scenes each containing: a component group 
formed by customizing and arbitrarily combining a plurality of 
components operating on browser software for providing each 
component with each service in accordance with specifications to 
be satisfied in order for the component to receive the service; and 

20 a script for operating the component, the service including: a 
service for reading a predetermined one of the multimedia content 
and loading the component group and the script contained in the scene 
constituting the multimedia content; a service for managing a life 
cycle of each component constituting the component group; a service 

25 for displaying the component in a predetermined 3D virtual space 
and erasing the component already displayed in the 3D virtual space; 
a service for providing a communication service between an 
interpreter of the script and the component; a service for managing 
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focus transition between a predetermined one of the components and 
another of the components; and a service for distributing an input 
from a predetermined external key to a corresponding one of the 
components . 

5 The program may further include a step of combining the 

produced multimedia content and the browser software to create one 
program. 

The producing step may include: a display control step 
of controlling to read the component in a distribution format and 

10 displaying on GUI; an addition step of adding the component selected 
on the GUI to the multimedia content; a providing step of providing 
the GUI which customizes the component contained in the multimedia 
content; and a converting step of converting the multimedia content 
edited by the display control step, the addition step and the 

15 providing step into a distributable format. 

A fourth recording medium storing a program according to 
the present invention is characterized by including a step of 
producing a multimedia content containing a collection of scenes 
each containing: a component group formed by customizing and 

20 arbitrarily combining a plurality of components operating on browser 
software for providing each component with each service in 
accordance with specifications to be satisfied in order for the 
component to receive the service; and a script for operating the 
component, the service including: a service for reading a 

25 predetermined one of the multimedia content and loading the 
component group and the script contained in the scene constituting 
the multimedia content; a service for managing a life cycle of each 
component constituting the component group; a service for displaying 
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the component in a predetermined 3D virtual space and erasing the 
component already displayed in the 3D virtual space; a service for 
providing a communication service between an interpreter of the 
script and the component; a service for managing focus transition 
5 between a predetermined one of the components and another of the 
components; and a service for distributing an input from a 
predetermined external key to a corresponding one of the components. 

The fourth program and recording medium of the present 
invention provides the program which is a multimedia content 

10 containing a collection of scenes each containing: a component group 
formed by customizing and arbitrarily combining a plurality of 
components operating on browser software for providing each 
component with each service in accordance with the specifications 
to be satisfied in order for the component to receive the service. 

15 A fifth program according to the present invention is 

characterized by including: a conversion step of converting a format 
of a multimedia content produced in accordance with a first format 
utilizing techniques of expressing a data structure by a text into 
a second format, wherein the multimedia content whose format was 

20 converted into the second format by the conversion step contains 
a collection of scenes each containing: a component group formed 
by customizing and arbitrarily combining a plurality of components 
operating on browser software for providing each component with each 
service in accordance with specifications to be satisfied in order 

25 for the component to receive the service; and a script for operating 
the component, the service including: a service for reading a 
predetermined one of the multimedia content and loading the 
component group and the script contained in the scene constituting 
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the multimedia content; a service for managing a life cycle of each 
component constituting the component group; a service for displaying 
the component in a predetermined 3D virtual space and erasing the 
component already displayed in the 3D virtual space; a service for 
5 providing a communication service between an interpreter of the 
script and the component; a service for managing focus transition 
between a predetermined one of the components and another of the 
components; and a service for distributing an input from a 
predetermined external key to a corresponding one of the components. 

10 The program may further include a step of combining the 

multimedia content whose format was converted into the second format 
by the conversion step and the browser software to create one program. 

The techniques of being utilized by the first format may 
be techniques utilizing XML. 

15 A fifth recording medium storing a program according to 

the present invention is characterized by including: a conversion 
step of converting a format of a multimedia content produced in 
accordance with a first format utilizing techniques of expressing 
a data structure by a text, from the first format into a second format, 

20 wherein the multimedia content whose format was converted into the 
second format by the conversion step contains a collection of scenes 
each containing: a component group formed by customizing and 
arbitrarily combining a plurality of components operating on browser 
software for providing each component with each service in 

25 accordance with specifications to be satisfied in order for the 
component to receive the service; and a script for operating the 
component, the service including: a service for reading a 
predetermined one of the multimedia content and loading the 
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component group and the script contained in the scene constituting 
the multimedia content; a service for managing a life cycle of each 
component constituting the component group; a service for displaying 
the component in a predetermined 3D virtual space and erasing the 
5 component already displayed in the 3D virtual space; a service for 
providing a communication service between an interpreter of the 
script and the component; a service for managing focus transition 
between a predetermined one of the components and another of the 
components; and a service for distributing an input from a 

10 predetermined external key to a corresponding one of the components. 

In the fifth program and recording medium, a format of 
a multimedia content produced in accordance with a first format 
utilizing techniques of expressing a data structure by a text, is 
converted from the first format into a second format . The multimedia 

15 content whose format was converted into the second format contains 
a collection of scenes each containing: a component group formed 
by customizing and arbitrarily combining a plurality of components 
operating on browser software for providing each component with each 
service in accordance with specifications to be satisfied in order 

20 for the component to receive the service; and a script for operating 
the component . 

An information processing apparatus as an apparatus for 
reproducing a multimedia content according to the present invention 
is characterized by including: acquiring means for acquiring a 

25 program, the program comprising: browser software for providing each 
component with each service in accordance with specifications to 
be satisfied in order for the component to receive the service; and 
a multimedia content formed by a collection of scenes each containing 
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a component group formed by customizing and arbitrarily combining 
a plurality of the components operating on the browser software and 
a script for operating the component, the service including: a 
service for reading a predetermined one of the multimedia content 
5 and loading the component group and the script contained in the scene 
constituting the multimedia content; a service for managing a life 
cycle of each component constituting the component group; a service 
for displaying the component in a predetermined 3D virtual space 
and erasing the component already displayed in the 3D virtual space; 

10 a service for providing a communication service between an 
interpreter of the script and the component; a service for managing 
focus transition between a predetermined one of the components and 
another of the components; and a service for distributing an input 
from a predetermined external key to a corresponding one of the 

15 components; executing means for executing the browser program 
acquired by the acquiring means; and reproducing means for 
reproducing the multimedia content acquired by the acquiring means, 
by utilizing services provided by the browser software executed by 
the executing means. 

20 The information processing apparatus may further include 

storing means for storing the browser software, the 
storing means storing the browser software in advance, wherein the 
executing means executes the browser software stored in advance in 
the storing means, when the multimedia content is acquired by the 

25 acquiring means. 

An information processing method for an information 
processing apparatus for reproducing a multimedia content according 
to the present invention is characterized by including: an acquiring 
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step of acquiring a program, the program comprising: browser 
software for providing each component with each service in 
accordance with specifications to be satisfied in order for the 
component to receive the service; and a multimedia content formed 
5 by a collection of scenes each containing a component group formed 
by customizing and arbitrarily combining a plurality of the 
components operating on the browser software and a script for 
operating the component, the service including: a service for 
reading a predetermined one of the multimedia content and loading 

10 the component group and the script contained in the scene 
constituting the multimedia content; a service for managing a life 
cycle of each component constituting the component group; a service 
for displaying the component in a predetermined 3D virtual space 
and erasing the component already displayed in the 3D virtual space; 

15 a service for providing a communication service between an 
interpreter of the script and the component; a service for managing 
focus transition between a predetermined one of the components and 
another of the components; and a service for distributing an input 
. from a predetermined external key to a corresponding one of the 

20 components; an executing step of executing the browser program 
acquired by the acquiring step; and a reproducing step of reproducing 
the multimedia content acquired by the acquiring step, by utilizing 
services provided by the browser software executed by the executing 
means . 

25 In the information processing apparatus as an apparatus 

for reproducing a multimedia content and the information processing 
method according to the present invention, a program is acquired 
which includes: browser software for providing each component with 
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each service in accordance with the specifications to be satisfied 
in order for the component to receive a predetermined service; and 
a multimedia content having a predetermined configuration, the 
acquired browser program is executed, the service provided by the 
5 executed browser software is utilized, and the acquired multimedia 
content is reproduced. 

An information processing apparatus as a first apparatus 
for producing a multimedia content according to the present 
invention is characterized in that the multimedia content is a 

10 collection of scenes each containing: a component group formed by 
customizing and arbitrarily combining a plurality of components 
operating on browser software for providing each component with each 
service in accordance with specifications to be satisfied in order 
for the component to receive the service; and a script for operating 

15 the component, the service including: a service for * reading a 
predetermined one of the multimedia content and loading the 
component group and the script contained in the scene constituting 
the multimedia content; a service for managing a life cycle of each 
component constituting the component group; a service for displaying 

20 the component in a predetermined 3D virtual space and erasing the 
component already displayed in the 3D virtual space; a service for 
providing a communication service between an interpreter of the 
script and the component; a service for managing focus transition 
between a predetermined one of the components and another of the 

25 components; and a service for distributing an input from a 
predetermined external key to a corresponding one of the components. 

An information processing method for an information 
processing apparatus as the first apparatus for producing a 
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multimedia content according to the present invention is 
characterized in that the multimedia content contains a collection 
of scenes each containing: a component group formed by customizing 
and arbitrarily combining a plurality of components operating on 
5 browser software for providing each component with each service in 
accordance with specifications to be satisfied in order for the 
component to receive the service; and a script for operating the 
component, the service including: a service for reading a 
predetermined one of the multimedia content and loading the 

10 component group and the script contained in the scene constituting 
the multimedia content; a service for managing a life cycle of each 
component constituting the component group; a service for displaying 
the component in a predetermined 3D virtual space and erasing the 
component already displayed in the 3D virtual space; a service for 

15 providing a communication service between an interpreter of the 
script and the component; a service for managing focus transition 
between a predetermined one of the components and another of the 
components; and a service for distributing an input from a 
predetermined external key to a corresponding one of the components. 

20 In the information processing apparatus as the first 

apparatus for reproducing a multimedia content and the information 
processing method according to the present invention, a multimedia 
content is produced which operates on the browser software and having 
a predetermined configuration, the browser software providing each 

25 component with each service in accordance with the specifications 
to be satisfied in order for the component to receive the service. 

An inf ormation processing apparatus as a second apparatus 
for producing a multimedia content according to the present 
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invention is characterized by comprising by including: producing 
means for producing the multimedia content in accordance with a first 
format utilizing techniques of expressing a data structure by a text; 
and converting means for converting a format of a multimedia content 
5 produced by the producing means, from the first format into a second 
format, wherein the multimedia content whose format was converted 
into the second format by the converting means contains a collection 
of scenes each containing: a component group formed by customizing 
and arbitrarily combining a plurality of components operating on 

10 browser software for providing each component with each service in 
accordance with specifications to be satisfied in order for the 
component to receive the service; and a script for operating the 
component, the service including: a service for reading a 
predetermined one of the multimedia content and loading the 

15 component group and the script contained in the scene constituting 
the multimedia content; a service for managing a life cycle of each 
component constituting the component group; a service for displaying 
the component in a predetermined 3D virtual space and erasing the 
component already displayed in the 3D virtual space; a service for 

20 providing a communication service between an interpreter of the 
script and the component; a service for managing focus transition 
between a predetermined one of the components and another of the 
components; and a service for distributing an input from a 
predetermined external key to a corresponding one of the components . 

25 The information processing apparatus may further include 

combining means for combining the multimedia content whose format 
was converted into the second format by the converting means and 
the browser software to create one program. 

28 



The techniques used by the first formation may be 
techniques utilizing XML. 

An information processing method for an information 
processing method for an information processing apparatus as the 
5 second apparatus for producing a multimedia content according to 
the present invention is characterized by including: a producing 
step of producing the multimedia content in accordance with a first 
format utilizing techniques of expressing a data structure by a text; 
and a converting step of converting a format of a multimedia content 

10 produced by the producing means, from the first format into a second 
format , wherein the multimedia content whose format was converted 
into the second format by the converting step contains a collection 
of scenes each containing: a component group formed by customizing 
and arbitrarily combining a plurality of components operating on 

1 5 browser software for providing each component with each service in 
accordance with specifications to be satisfied in order for the 
component to receive the service; and a script for operating the 
component, and the service including: a service for reading a 
predetermined one of the multimedia content and loading the 

20 component group and the script contained in the scene constituting 
the multimedia content; a service for managing a life cycle of each 
component constituting the component group; a service for displaying 
the component in a predetermined 3D virtual space and erasing the 
component already displayed in the 3D virtual space; a service for 

25 providing a communication service between an interpreter of the 
script and the component; a service for managing focus transition 
between a predetermined one of the components and another of the 
components; and a service for distributing an input from a 
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predetermined external key to a corresponding one of the components . 

In the information processing apparatus as the first 
apparatus for reproducing a multimedia content and the information 
processing method according to the present invention, the multimedia 
5 content is produced in accordance with a first format utilizing 
techniques of expressing a data structure by a text, and a format 
of the multimedia content produced by the producing means is 
converted from the first format into a second format . The multimedia 
content whose format was converted into the second format contains 

10 a collection of scenes each containing: a component group formed 
by customizing and arbitrarily combining a plurality of components 
operating on browser software for providing each component with each 
service in accordance with specifications to be satisfied in order 
for the component to receive the service; and a script for operating 

1 5 the component . 

An information processing system according to the present 
invention wherein: a first information processing apparatus for 
producing a multimedia content and distributing the produced 
multimedia content; and a second information processing apparatus 

20 for reproducing the multimedia content distributed by the first 
information processing apparatus, are interconnected by a 
predetermined network, and the information processing system is 
characterized in that: the first information processing apparatus 
produces the multimedia content containing a collection of scenes 

25 each containing: a component group formed by customizing and 
arbitrarily combining a plurality of components operating on browser 
software for providing each component with each service in 
accordance with specifications to be satisfied in order for the 
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component to receive the service; and a script for operating the 
component, the service including: a service for reading a 
predetermined one of the multimedia content and loading the 
component group and the script contained in the scene constituting 
5 the multimedia content; a service for managing a life cycle of each 
component constituting the component group; a service for displaying 
the component in a predetermined 3D virtual space and erasing the 
component already displayed in the 3D virtual space; a service for 
providing a communication service between an interpreter of the 

10 script and the component; a service for managing focus transition 
between a predetermined one of the components and another of the 
components; and a service for distributing an input from a 
predetermined external key to a corresponding one of the components, 
and distributes as one program the produced multimedia content and 

15 the browser software to the second information processing apparatus 
via the network, and the second information processing apparatus 
reproducing the multimedia content in the distributed program on 
the browser software in the distributed program- 
In the information processing system according to the 

20 present invention, the first information processing apparatus 
produces the multimedia content having a predetermined 
configuration and operating on browser software for providing each 
component with each service in accordance with the specifications 
to be satisfied in order for the component to receive the service, 

25 one program constituted of the produced multimedia content and 
browser is distributed, and the second information processing 
apparatus displays the distributed multimedia content on the 
distributed browser software. 



The computer for executing the program of the present 
invention may be a stand-alone computer such as a personal computer 
or a computer built in another apparatus such as a digital electronic 
home appliance. Similarly, the information processing apparatus 
of the present invention may be a personal computer and the like 
or another apparatus such as a digital electronic home appliance. 

Brief Description of the Drawings 

Fig. 1 is a perspective view showing an example the 
configuration of a W.W.W system illustrating the operation principle 
of W.W.W. 

Fig. 2 is a diagram showing an example of an HTML document 
and its display example. 

Fig. 3 is a diagram showing a display example of the HTML 
document shown in Fig. 2 when presentation consistency is not 
guaranteed. 

Fig. 4 is a diagram showing another display example of 
the HTML document shown in Fig. 2 when presentation consistency is 
not guaranteed. 

Fig. 5 is a diagram showing another display example of 
the HTML document shown in Fig. 2 when presentation consistency is 
guaranteed. 

Fig. 6 is a diagram illustrating a distribution method 
for an HTML document. 

Fig. 7 is a diagram illustrating a distribution method 
for a Java(R) application. 

Fig. 8 is a diagram showing an example of the structure 
of upper, lower, right and left keys and a decision key. 
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Fig. 9 is a diagram showing an example of the configuration 
of an information distribution system to which the invention is 
applied. 

Fig. 10 is a block diagram showing an example of the 
5 structure of a multimedia content producing apparatus of the 
information distribution system shown in Fig. 9. 

Fig. 11 is a block diagram showing an example of the 
structure of a server of the information distribution system shown 
in Fig. 9. 

10 Fig. 12 is a block diagram showing an example of the 

structure of a reception terminal of the information distribution 
• system shown in Fig. 9. 

Fig. 13 is a diagram illustrating the mechanism of 
reproducing a multimedia content. 
15 Fig. 14 is a diagram showing software stacks on the side 

of the reception terminal. 

Fig. 15 is a diagram showing an example of the arrangement 
of distribution software shown in Fig. 13. 

Fig. 16 is a diagram showing the structure of a MComponent 
20 package which is the interface with the browser and components shown 
in Fig. 13. 

Fig. 17 is a diagram showing an example of focus transition 
of the browser shown in Fig. 13. 

Fig. 18 is a diagram illustrating the sequence of a media 
25 synchronizing mechanism of the browser shown in Fig. 13. 

Fig. 19 is a block diagram showing an example of the 
structure of the browser shown in Fig. 13. 

Fig. 20 is a status transition diagram illustrating the 
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life cycle of a component of the browser shown in Fig. 13. 

Fig. 21 is a flow chart illustrating a scene activating 
process by the browser shown in Fig. 13. 

Fig. 22 is a flow chart illustrating a scene ending process 
5 by the browser shown in Fig. 13. 

Fig. 23 is a diagram illustrating the sequence of an event 
distribution mechanism of the browser shown in Fig. 13. 

Fig. 24 is a flow chart illustrating a process by an 
application manager shown in Fig. 14. 
10 Fig. 25 is a diagram showing a display example when an 

authoring system is activated. 

Fig. 2 6 is a diagram showing an example of the reproduction 
of a multimedia content produced by the authoring system shown in 
Fig. 25. 

15 Fig. 27 is a diagram showing an example of the arrangement 

of software of the multimedia content producing apparatus shown in 
Fig. 9 mounted with conversion software. 

Fig. 28 is a flow chart illustrating a process by the 
conversion software shown in Fig. 27. 
20 Fig. 2 9 is a diagram showing an example that a multimedia 

content whose format was converted by the conversion software shown 
in Fig. 27 is distributed to a reception apparatus shown in Fig. 
9 and reproduced. 

Fig. 30 is a diagram showing an example of XML data (a 
25 multimedia content of an XML format) shown in Fig. 27. 

Fig. 31 is a diagram showing the example of the XML data 
(a multimedia content of the XML format) shown in Fig. 27. 

Fig. 32 is a diagram showing the example of the XML data 
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(a multimedia content of the XML format) shown in Fig. 27. 

Fig. 33 is a diagram showing the example of the XML data 
(a multimedia content of the XML format) shown in Fig. 27. 

Fig. 34 is a diagram showing the example of the XML data 
5 (a multimedia content of the XML format) shown in Fig. 27. 

Fig. 35 is a diagram showing the example of the XML data 
(a multimedia content of the XML format) shown in Fig. 27. 

Fig. 36 is a diagram showing the example of the XML data 
(a multimedia content of the XML format) shown in Fig. 27. 
10 Fig. 37 is a diagram showing the example of the XML data 

(a multimedia content of the XML format) shown in Fig. 27. 

Fig. 38 is a diagram showing the example of the XML data 
(a multimedia content of the XML format) shown in Fig. 27. 

Fig. 39 is a diagram showing the example of the XML data 
15 (a multimedia content of the XML format) shown in Fig. 27. 

Fig. 40 is a diagram showing the example of the XML data 
(a multimedia content of the XML format) shown in Fig. 27. 

Fig. 41 is a diagram showing the example of the XML data 
(a multimedia content of the XML format) shown in Fig. 27. 
20 Fig. 42 is a diagram showing the example of the XML data 

(a multimedia content of the XML format) shown in Fig. 27. 

Fig. 43 is a diagram showing the example of the XML data 
(a multimedia content of the XML format) shown in Fig. 27. 

Fig. 44 is a diagram showing the example of the XML data 
25 (a multimedia content of the XML format) shown in Fig. 27. 

Fig. 45 is a diagram showing the example of the XML data 
(a multimedia content of the XML format) shown in Fig. 27. 

Fig. 46 is a diagram showing the example of the XML data 
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(a multimedia content of the XML format) shown in Fig. 27. 

Fig. 47 is a diagram showing the example of the XML data 
(a multimedia content of the XML format) shown in Fig. 27. 

Fig. 48 is a diagram showing the example of the XML data 
5 (a multimedia content of the XML format) shown in Fig. 27. 

Fig. 4 9 is a diagram showing the example of the XML data 
(a multimedia content of the XML format) shown in Fig. 27. 

Fig. 50 is a diagram showing the example of the XML data 
(a multimedia content of the XML format) shown in Fig. 27. 
10 Fig. 51 is a diagram showing the example of the XML data 

(a multimedia content of the XML format) shown in Fig. 27. 

Fig. 52 is a diagram showing the example of the XML data 
(a multimedia content of the XML format) shown in Fig. 27. 

Fig. 53 is a diagram showing the example of the XML data 
15 (a multimedia content of the XML format) shown in Fig. 27. 

Fig. 54 is a diagram showing the example of the XML data 
(a multimedia content of the XML format) shown in Fig. 27. 

Fig. 55 is a diagram showing the example of the XML data 
(a multimedia content of the XML format) shown in Fig. 27. 
20 Fig. 56 is a diagram showing the example of the XML data 

(a multimedia content of the XML format) shown in Fig. 27. 

Fig. 57 is a diagram showing the example of the XML data 
(a multimedia content of the XML format) shown in Fig. 27. 

Fig. 58 is a diagram showing the example of the XML data 
25 (a multimedia content of the XML format) shown in Fig. 27. 

Fig. 59 is a diagram showing the example of the XML data 
(a multimedia content of the XML format) shown in Fig. 27. 

Fig. 60 is a diagram showing the example of the XML data 
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(a multimedia content of the XML format) shown in Fig. 27. 

Fig. 61 is a diagram showing the example of the XML data 
(a multimedia content of the XML format) shown in Fig. 27. 

Fig. 62 is a diagram showing an example of a DTD shown 
5 in Fig. 27. 

Fig. 63 is a diagram showing the example of the DTD shown 
in Fig. 27. 

Best Modes for Carrying Out the Invention 
10 First, Java(R) will be described which is one example of 

basic technologies applied to preferred embodiments of the present 
invention. 

Java(R) is fundamental software for developing or 
executing general programs, and has been developed by Sun 
15 Microsystems as software suitable for software distribution on 
information networks. 

The cores of Java(R) technologies are those indicated in 
the following (1) to (4) . 

(1) Program language specifications called a Java(R) 

20 language. 

(2) A compiler for converting a program written with the 
Java(R) language into the format called intermediate code. 

(3) A Java(R) virtual machine which is software for 
executing the intermediate code on a computer. 

25 (4) A class library standardized for the Java(R) 

language. 

The virtual machine and standardized class library among 
others are collectively called a Java(R) execution environment. 
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The specifications of the intermediate code and library 
are designed independent from the platform such as an OS, so that 
only the standardized library is used as an external module and if 
the Java (R) execution environment is supported on the terminal side, 
5 a developed Java (R) program (in actual, the Java (R) program compiled 
to the intermediate code) can be executed irrespective of the type 
of the platform at the lower level. An external module other than 
the standardized library can be used if it is developed by the Java (R) 
language and if it exists on each platform. 

10 The standardized library of Java(R) is provided with a 

function of downloading a particular file from a network through 
a protocol such as HTTP by designating an URL, and other functions. 
It is therefore possible to acquire a Java(R) application from an 
information network and execute it. 

15 The characteristics of Java(R) described above provide 

an ample possibility of content distribution via a network. The 
reason for this is that if the Java(R) execution environment is 
mounted on a terminal such as a personal computer and a digital 
electronic home appliance, not distribution of data based upon a 

20 format such as HTML, but distribution of an application having its 
own logic becomes possible on a cross platform. 

Fig. 6 and Fig. 7 show a difference of a distribution 
method between an HTML document and Java(R) content. Namely, Fig. 
6 shows a distribution method for HTML content, and in contrast, 

25 Fig. 7 shows a distribution method for a Java(R) application. 

As shown in Fig. 6, for the HTML content distribution, 
a distribution content 61, specifically an HTML document 63, i.e., 
data, is distributed via a network. Terminal software 62 is 
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constituted of an HTML browser 64 for analyzing and reproducing the 
HTML document 63, and a platform 63 such as an OS. Therefore, as 
described above, the functions capable of being realized by the 
content are limited to the functions of the HTML browser. 
5 On the other hand, as shown in Fig. 7, for the Java (R) 

application distribution, a distribution content 71, specifically 
a Java(R) application 73, is distributed via a network. Terminal 
software 72 is a Java(R) execution environment 74 for executing the 
Java(R) application 73, and a platform 75. Therefore, any of the 

10 functions capable of being mounted by using the Java(R) language 
can be realized as a distribution content. 

In advance of this trend, MHP (Multimedia Home Platform) , 
which is the specifications of a software platform for digital 
electronic home appliances standardized by the European 

15 standardization organization DVB (Digital Video Broadcasting), 
incorporates actually the support of an execution environment for 
an application formed by the Java(R) language. 

Java(R) is a software environment used for general 
applications. Developing a content of Java (R) requires programming 

20 skills of the Java (R) language, and it is not realistic for designers 
and content creators to develop a content directly by using the 
Java(R) language, and this should not be the case. Furthermore, 
the content development cost becomes higher than that of HTML in 
some cases. 

25 Therefore, the above-described first to sixth issues are 

not solved merely by adopting Java(R) . Namely, it is important to 
solve these issues by utilizing the features of Java(R). 

According to preferred embodiments of the present 
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invention, the above-described issues are solved by realizing 
software capable of production, distribution and reproduction of 
a multimedia content having the following characteristics. 

Making realizable presentation and manipulation suitable 
5 for multimedia can be enumerated as the first characteristic. 
Specifically, there are characteristics (11) to (15) given in the 
following. 

(11) Guarantee of Presentation Consistency 

In the software according to a preferred embodiment of 

10 the present invention, a multimedia content is produced by disposing 
each of mono-media on a screen. In actual, each of various types 
of mono-media is realized as a kind of a component to be described 
later. The layout of a content by the software according to the 
preferred embodiment of the present invention is made by designating 

15 the position of each component constituting the content by the 
absolute coordinates on the screen. In this manner, the produced 
content is presented by the same layout and design as those at the 
time of production, irrespective of the type of a platform or the 
window size of a browser. 

20 (12) Navigation with Upper, Lower, Right, Left, and 

Decision Keys . 

In the software according to the preferred embodiment of 
the present invention, (upper, lower, right and left) arrow keys 
and a decision key are used as the interface between a user and a 

25 content . It is supposed that the platform for executing the software 
according to the preferred embodiment of the invention provides 
these keys in a hardware way. Fig. 8 shows an example of a key group 
as hardware. Namely, the interface between a user and a content 
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according to the preferred embodiment of the present invention is 
constituted of a key 91-1 as the decision key, a key 91-2 as the 
upper arrow key, a key 91-3 as the right arrow key, a key 91-4 as 
the lower arrow key, and a key 91-5 as the left arrow key. A typical 
5 input device having the key group of this type is a remote controller 
(hereinafter simply called a remo-con) . An input/output device 
having this key group 91 is not limited to a remo-con, but obviously 
personal computers, and also mobile terminals and various digital 
electronic home appliances can perform interaction with a content 
10 by the same manipulation. By using the button group 91, a user 
selects one of the components on the screen to perform interaction 
with the content. 

(13) Content Structure of Scene Unit 

A multimedia content by the software according to the 
15 preferred embodiment of the present invention is constituted of a 
plurality of scenes (screens) and distributed in the scene unit. 

(14) Media Synchronizing Mechanism 

The preferred embodiment of the present invention 
provides a function of presenting synchronously a plurality of media 
20 having a time axis such as moving images, audio and animation. 

(15) Presentation on 3D Base 

A multimedia content according to the preferred 
embodiment of the present invention is based on presentation on the 
3D base. It is therefore possible to configure a user interface 
25 easy to understand intuitively. 

A component framework allowing realizing sophisticated 
content production at a reduced cost can be enumerated as the second 
characteristic of the software according to the preferred embodiment 
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of the present invention. Specifically, there are characteristics 
(21) to (24) given in the following. 

(21) A content is configured by combining arbitrary 

components . 

5 Each scene of a multimedia content according to the 

preferred embodiment of the present invention is configured by a 
combination of arbitrary components. Mono-media components such 
as moving images and still images or GUI (Graphical User Interface) 
components such as buttons can be enumerated as examples of 

10 components. Actual content production is performed by using an 
authoring system as will be later described. A content producer 
can add a component to a scene to customize the component by using 
the authoring system, or can write the behavior of the component 
by using the script language. 

15 (22) . Content Reproduction by Browser Software 

(hereinafter simply called a browser) 

A multimedia content according to the preferred 
embodiment of the present invention is reproduced by the browser. 
The browser loads a designated scene and performs read-in and 3D 

20 display of a component group constituting the scene. Further, the 
browser provides a component with services such as life cycle 
management of components, key input navigation support and binding 
of scripts and components. 

(23) Interface Specification of Component 

25 In order to receive services of the browser, it is 

necessary that each component is mounted in the manner satisfying 
the defined interface specification, and in other cases, the 
component and browser are perfectly separated in terms of software. 
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Therefore, if the specification is satisfied, a component having 
an arbitrary function can be developed newly and freely and assembled 
with a content. 

(24) Component Control by Script Language 
5 The software according to the preferred embodiment of the 

present invention allows even general content producers considered 
having no skill in expert programming languages such as designers 
and content creators to write the control of behavior of each 
component, such as manipulating a component, and receiving an event 
10 from the component and executing a process corresponding to the event 
type. 

A content reproduction environment of a cross platform 
can be enumerated as the third characteristic of the software 
according to the preferred embodiment of the present invention. 
15 Specifically, there are characteristics (31) to (32) given in the 
following. 

(31) Distribution of Browser 

A browser can be distributed along with a content. 
Version-up of a browser and mutual existence with other 
20 architectures are therefore easy. 

(32) Cross Platform 

The content and browser to be distributed are 
applications formed by a language operable on the cross platform 
such as Java(R), and can be executed if the platform is supplied 
25 with the operation environment of the applications. 

A preferred embodiment of the present invention will be 
described using Java(R). 

Fig. 9 shows an example of the configuration of an 
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information distribution system applied with the preferred 
embodiment of the present invention. 

As shown in Fig. 9, in the information distribution system 
101, a service provider apparatus group 111 (specifically a server 
5 124 to be described later) managed by a service provider and a 
reception terminal 113 managed by a user are interconnected via a 
predetermined network 112. 

The service provider apparatus group 111 is constituted 
of a mono-media producing apparatus 121, a component developing 

10 apparatus 122, a multimedia content producing apparatus 123, and 
a server 124 . In this embodiment, although the mono-media apparatus 
121 to server 124 are discrete apparatuses, the mono-media apparatus 
121 to server 124 may be collectively formed as one apparatus or 
they may be divided into a predetermined number of apparatuses (e.g. , 

15 two apparatuses: a first apparatus having the functions of the 
mono-media producing apparatus 121 to multimedia content producing 
apparatus 123 and a second apparatus having the function of the server 
124) . Separation points of the division are arbitrary. 

Although the entity which manages the service provider 

20 apparatus group 111 is not particularly limited, the entity is the 
subject which produces a multimedia content and distributes the 
content to a user (reception terminal 113) via the network 112. It 
is conceivable that the subject may be various types of subjects 
such as individuals aiming at profit or nonprofit and parties. 

25 The mono-media producing apparatus 121 produces various 

mono-media to be mounted on a multimedia content such as moving images, 
still images, texts and 3D shapes, and supplies them to the multimedia 
content producing apparatus 123. The mono-media producing 



44 



apparatus 121 is generally operated by a designer. 

The component developing apparatus 122 develops 
components according to the preferred embodiment of the present 
invention and supplies them to the multimedia content producing 
5 apparatus 123. Since components are developed by the Java(R) 
language, the component developing apparatus 122 is generally 
operated by a programmer. If a multimedia content can be produced 
by using only already existing components, the component developing 
apparatus 122 may be omitted. 

10 The multimedia content producing apparatus 123 combines 

various components supplied from the component developing apparatus 
122 to produce a multimedia content, assembles various mono-media 
supplied from the mono-media producing apparatus 121 in the 
multimedia content, and supplies them to the server 124. 

15 Further, the multimedia content producing apparatus 123 

produces, when necessary, software constituted of a produced 
multimedia content and a browse for reproducing it, and supplies 
the program to the server 124. 

Specif ically, the multimedia content producing apparatus 

20 123 is installed with the authoring system of the type to be described 
later, and a content producer uses this system to assemble components 
into the content, customizes them, and writes the behavior of them 
by using the script language. In this case, the content producer 
is not required to have the knowledge of the Java(R) language. 

25 Therefore, the multimedia content producing apparatus 123 is 
generally operated directly by a designer or content creator even 
not having the knowledge of the Java(R) language. 

The server 124 is connected to the network 112 and 
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distributes a multimedia content supplied from the multimedia 
content producing apparatus 123 to the reception terminal 113 via 
the network 112. Further, the server 124 distributes, when 
necessary, software constituted of a multimedia content and a 
5 browser for reproducing the content. As described earlier, this 
software may be supplied from the multimedia content producing 
apparatus 123 or may be newly produced by the server 124. 

The server 124 is installed with server application 
software for executing processes of this kind. Although the type 

10 of an server application changes with the type of a network or a 
protocol, if distribution is performed on the Internet by using the 
HTTP protocol, a general HTTP server corresponds to the server 124. 

Although the network 112 is not limited particularly, it 
is an information network capable of transmitting a multimedia 

15 content, such as the Internet. 

The reception terminal 113 is operated by a user who 
utilizes a multimedia content. Although a user is supposed to be 
a general user in a home, it is not limited thereto. Further, the 
reception terminal 113 is not limited to a particular one, and other 

20 terminals capable of utilizing a multimedia content may also be used, 
such as various digital electronic home appliances in addition to 
personal computers . 

However, in order to apply the preferred embodiment of 
the present invention, the reception terminal 113 is required to 

25 have a multimedia content and browser 131 as the Java(R) application 
software distributed via the network, a Java(R) execution 
environment 132 as the software environment for executing 
distributed Java(R) application software, and a platform 133 for 



46 



supporting the Java(R) execution environment 132. 

In this example, although the Java(R) execution 
environment 132 is a JRE (Java(R) Runtime Environment) distributed 
by Sun Microsystems, it may be others of Java(R) software. 
5 Next, with reference to Fig. 10, description will be made 

on an example of the structure of the multimedia content producing 
apparatus 123. 

A CPU 141 executes various processes in accordance with 
programs stored in a ROM 142 or programs loaded from a storage unit 
10 148 into a RAM 143. 

RAM 143 stores also data necessary for CPU 141 to execute 
various processes, when necessary. 

CPU 141, ROM 142 and RAM 143 are interconnected by a bus 
144. The bus 144 is also connected to an input/output interface 
15 145. 

The input/output interface 145 is connected to an input 
unit 14 6 constituted of a keyboard and the like, an output unit 147 
constituted of a display for displaying images (moving images and 
still images), a speaker for outputting audio sounds and the like, 
20 the storage unit 148 constituted of a hard disk and the like, and 
a communication unit 14 9 for executing a communication process with 
the mono-media producing apparatus 121, component developing 
apparatus 122, server 124 and the like respectively shown in Fig. 
9. 

25 The input/output unit 145 is also connected to a drive 

150 when appropriate, on which a removable recording medium 151 is 
mounted when appropriate, such as a magnetic disk, an optical disk, 
a magneto optical disk and a semiconductor memory, and a computer 



47 



program read from the recording medium 151 is installed in the storage 
unit 14 8 when necessary. 

The removal recording medium 151 is recorded, when 
necessary, with mono-media produced by the mono-media producing 
5 apparatus 121, components developed by the component developing 
apparatus 122, a multimedia content produced by the multimedia 
content producing apparatus 123 and the like. 

Fig. 11 shows an example of the structure of the server 
124. A CPU 161 to a removable recording medium 171 have basically 

10 similar structures to those having corresponding names of CPU 141 
to removable recording medium 141 of the multimedia content 
producing apparatus 123 of Fig. 10 described above, and the 
description thereof is therefore omitted. A communication unit 169 
executes a communication process with the multimedia content 

15 producing apparatus 123 and network 112 (Fig. 9). 

Fig. 12 shows an example of the structure of the reception 
terminal 113. A CPU 181 to a removable recording medium 191 have 
basically similar structures to those having corresponding names 
of CPU 141 to removable recording medium 141 of the multimedia content 

20 producing apparatus 123 of Fig. 10 described above, and the 
description thereof is therefore omitted. A communication unit 169 
executes a communication process with the network 112 (Fig. 9) . In 
the example shown in Fig. 12, although the reception terminal 113 
has the style of a personal computer, the style is not limited thereto, 

25 but it may use various styles such as digital electronic home 
appliances as described earlier. Therefore, the input unit 186, 
output unit 187 and the like may use various types depending upon 
the type of the reception terminal 113. 
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Although not shown, each of the mono-media producing 
apparatus 121 and component developing apparatus 122 has basically 
a similar structure to that of the multimedia content producing 
apparatus 123 of Fig. 10 described above. 
5 Next, with reference to Fig. 13, description will be made 

on the mechanism of an example of multimedia content reproduction 
to which a preferred embodiment of the present invention is applied. 

In this example, not only a content but also a browser 
is distributed via a network 112 by positively incorporating the 

10 characteristics of Java(R) capable of distributing applications on 
a cross platform. 

As indicated by an arrow 241 in Fig. 13, when a URL is 
input to the reception terminal 113, the Java(R) execution 
environment (strictly speaking, an application manager to be later 

15 described) 132 issues an access request to the server 124 on the 
network 112 designated by URL to access the server 124, as indicated 
by an arrow 242. The Java(R) execution environment (strictly 
.speaking, the ^application manager to be described later) 132 
acquires distribution software (Java(R) application software) 243 

20 constituted of a multimedia content 2 52 designated by the user and 
a browser 251 for reproducing the content, activates the acquired 
browser 251 and executes content reproduction 244 for the multimedia 
content 243. For example, as the result of the content reproduction 
244, an image 461 shown in Fig. 26 to be described later is displayed 

25 on a display (the output unit 187 in Fig. 12) of the reception terminal 
113. 

As above, since also the browser 251 is distributed via 
the network 112, the user is not necessary to be conscious of updating 
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the browser 251. Further, if the Java(R) execution environment 132 
only is standardized, software distribution of a combination of 
browsers having a variety of architectures and content can be freely 
(without standardization) performed, 
5 Furthermore, like a conventional browser, the browser 251 

may be fixedly mounted on the reception terminal 113 to run it. In 
this case, since it is not necessary to distribute the browser 251 
each time the multimedia content 252 is distributed, the size of 
distribution data can be suppressed. 
10 Next, with reference to the drawing, the operation 

environment according to a preferred embodiment of the present 
invention will be described. Fig. 14 shows software stacks on the 
side of the reception terminal 113. 

As shown in Fig. 14, a Java(R) application 243-1 exists 
15 as distribution software 243, and an application manager 271, a 
Java(R) execution environment 132 and a platform 133 exist as 
terminal software 2 61. 

As described above, the Java(R) application 243-1 is 
constituted of the multimedia content 252 and browser 251. The 
20 multimedia content 252 is structured, for example, by collecting 
a plurality of scene data 191. 

As shown in Fig. 13, the application manager 271 is 
software for downloading the multimedia content 252 and browser 251 
(distribution software 243) from the network 112 in accordance with 
25 URL input from the user, and executing them. 

The above-described Java(R) execution environment 132 is 
constituted, for example, of an extended API (Application Program 
Interface) 272, a JRE (Java (R) Runtime Environment) 273 and the like. 



50 



JRE 273 is an execution environment for Java(R) 
applications distributed by Sun Microsystems, and is constituted 
mainly of a JVM (Java(R) Virtual Machine) 284 and a J2SE API (Java (R) 2 
Platform Standard Edition API) 283. JVM 284 is software (Java(R) 
5 virtual machine) for analyzing and executing a Java(R) application 
expressed by intermediate code, and J2SE API 283 is a class library 
group standardized for Java(R) applications. 

In the example, the extended API 272 is a Java(R) class 
library group added to J2SE, and representative ones are Java (R) 3D 

10 282 and an ECMA (European Computer Manufactures Association) script 
interpreter (trademark) 281. 

Java (R) 3D 282 is a class library developed by Sun 
Microsystems for realizing an application utilizing 3D in a Java(R) 
program. It can configure a 3D virtual space by directly calling 

15 a method from a program, as different from a script language 
configuring declaratively a 3D virtual space such as a VRML (Virtual 
Reality Modeling Language) . 

In this example, the ECMA script interpreter 281 is added 
because an ECMA script is used as the script language for manipulating 

20 components. The ECMA script is the Java(R) script developed by 
Netscape Communications Corporation and standardized by the 
standardization organization ECMA. The ECMA script has the control 
structure such as conditional branch and repetition like a general 
programming language so that a flexible component control is 

25 possible. 

The platform 133 is constituted of an 
OS/driver /middleware 27 3 made of an OS, drivers, middleware and the 
like and hardware (such as CPU 181 to drive 190 shown in Fig. 12) . 
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Next, an exchange format will be described. 

The entity of a component includes a Java(R) class file 
constituting the component, and an object having an instanced class 
and set with a necessary field value, i.e., customized. 

Each component constituting a scene is written in one file 
per scene (hereinafter called an instance file) by using a 
serialization mechanism provided by a Java(R) . io package. In this 
example, the extension of an instance file is "dat". When a scene 
is reproduced, the browser reconfigures the object of each component 
from the class file group and instance file. 

Fig. 15 shows an example of the structure of software 
actually distributed to the reception terminal 113 via the network 
112. 

As shown in Fig. 15, an archive file 292 of scene data 
191 constituting each scene is constituted of an instance file 301, 
a class file 302 of components, a script file 303 in which code of 
the ECMS script is written in the text format, and a file 304 for 
texts, still images or the like . 

Of the archive file 292 of the scene data 191, the instance 
.file 301, class file 302 for class components and script file 303 
are archived by a method called a Jar type provided by Java (R) . This 
archive also contains, when necessary, a file (mono-media file) 
group 304 for texts, still images or the like used when each component 
is executed. 

A file constituting the archive file group of scene data 
is hereinafter called a scene file. The name of a scene file to 
be reproduced first is "startup, jar" being distinguished from other 
scene files, "jar" is an extension of an archive file of the jar 
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type. 

The name of the instance file 301 of each scene is the 
same name as that of the scene file 292. Therefore, the instance 
file of the startup scene is "startup.dat". 

The browser 251 is also archived by using the Jar type. 
Therefore, the distribution software is constituted of the archive 
file 291 of the browser 251 and a group of scene files (Jar files) 
292 corresponding to respective scenes. 

A file called a manifest file is built in the Jar file. 
This file is written with settings of various attributes in the text 
format. Each attribute is called an attribute. In this example, 
the attribute of the startup, jar file is extended so that a URL of 
the browser and a URL of each scene other than startup can be set. 

Also in this example, when an instance of each component 
is written in a file or read from the file, although the serialization 
mechanism provided by the Java (R) . io package is used, the embodiment 
is not limited only thereto, but techniques of the text format such 
as XML may also be used. 

Next, an interface between the browser 251 and a component 
will be described. 

In this example, the interface between the browser 251 
and a component is configured on Java (R) Beans . Java (R) Beans is a 
component framework provided by Java(R). 

Each component is required to mount a component interface 
defined by an mmcomponent in accordance with the function of the 
component, while the specification of Java (R) Beans is satisfied. 

Each interface is used for allowing a component to receive 
various services from the browser, and has an inheritance relation 
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such as shown in Fig. 16. In this drawing, the lower level interface 
inherits an upper level interface. 

With reference to Fig. 16, the details of the mmcomponent 
package will be described. 
5 An MComponent interface (hereinafter, simply described 

as MComponent) 317 is an interface all components are required to 
mount it. MComponent 317 provides a getld() method. The getld() 
method returns an ID character string of the component, and the 
browser distinguishes unanimously each component from the ID value 

10 obtained by the method. MComponent 317 also provides an 
initialize () method and a destroy () method. The component is 
required to execute an initialize process by the initialize ( ) method 
and an end process by the destroy () method. 

An MPresentable interface (hereinafter, simply described 

15 as MPresentable) 316 is used for defining the operation of a component 
having some presentation. MPresentable 316 provides a run ( ) method 
and a stop() method. The run ( ) method starts presentation, and the 
stop() method terminates the presentation. Whether presentation 
is under execution or not is judged by the true-false value returned 

20 by a getActive() method. A getlnitiallyActive ( ) method returns a 
true-false value for whether or not presentation is started 
immediately when the component starts the scene. 

An MVisible interface (hereinafter, simply described as 
MVisible) 315 defines the behavior of a visible component. 

25 An M3DComponent interface (hereinaf ter, simply described 

as M3DComponent) 313 defines the behavior of a component having 3D 
presentation. M3DComponent 313 provides a getSubGraph ( ) method. 
This method returns an instance of a BranchGroup of Java (R) 3D API. 
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The corresponding object is an entity of 3D presentation of the 
component, and the browser adds the BranchGroup object obtained by 
the method to a scene graph provided by Java (R) 3D. In this manner, 
3D presentation of the component enters a visible state. Further, 
5 M3DComponent 313 provides methods such as a get/setPosition ( ) method, 
a get/setRotation ( ) method and a get/setScale ( ) method. These 
methods designate the position, posture and scale of a component 
of each scene constituting a moving image 312 so that presentation 
consistency can be guaranteed. 

10 An MBackground interface (hereinafter, simply described 

as MBackground) 314 defines the behavior of a component expressing 
the background of a 3D space. MBackground 314 provides a 
getBackgroundGraph ( ) method. This method returns a BranchGroup 
object. This object is an entity of background presentation and 

15 is added to the scene graph by the browser. 

An MInteractible interface (hereinafter, simply 
described as MInteractible) 318 defines the behavior of an 
interactive component such as a button 311. A component mounted 
with MInteractible 318 becomes an object of focus management. Of 

20 components mounted with MInteractible 318, there is one component 
in a scene at the most, which holds a focus at an arbitrary timing. 

When a user performs a key input by manipulating the key 
group ( Fig . 8 ) , the browser 251 calls a keydown ( ) method and a keyup ( ) 
method of the component holding the focus (component mounted with 

25 MInteractible 318) . The component can therefore receive the user 
key input. If the type of an input key is one of the arrow keys 
91-2 to 91-5 (Fig. 8) , the focus transits and another component newly 
acquires the focus. At this time, a blur ( ) method is called for 
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the component having held the focus, whereas a f ocus ( ) method is 
called for the component newly acquired the focus (component mounted 
with MInteractible 318) . 

The component having the focus holds ID values as to 
5 whether the focus transits to which component in response to an input 
by any one of the upper, lower, right and left arrow keys 91-2 to 
91-5 while the component holds the focus. These values can acquire 
from the values returned by a getNavupO method, a getNavdown ( ) 
method, a getNavLeft() method, and a getNavright ( ) method 

10 respectively provided by MInteractible 318. 

An example of focus transition of this type will be 
described with reference to Fig. 17. 

It is now assumed that components are disposed as shown 
in Fig. 17, the components including a component (hereinafter, 

15 simply called a button) 311-1 mounted with MInteractible 318 having, 
"button A" as its ID value, a component (hereinafter, simply called 
a button) 311-2 mounted with MInteractible 318 having "button B" 
as its ID value, a component (hereinafter, simply called a button) 
311-3 mounted with MInteractible 318 having "button C" as its ID 

20 value, a component (hereinafter, simply called a button) 311-4 
mounted with MInteractible 318 having "button D" as its ID value, 
and a component (hereinafter, simply called a button) 311-5 mounted 
with MInteractible 318 having "button E" as its ID value. It is 
also assumed that the getNavupO method of the button 311-1 returns 

25 "button B", the getNavdown () method returns "button D", the 
getNavLef t ( ) method returns "button E" and the getNavright ( ) returns 
"button C". 

For example, assuming now that the button 311-1 holds the 
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focus, when a user depresses the upper arrow key 91-2 (Fig. 8), a 
focus management unit 372 (Fig. 23) of the browser 251 to be described 
later executes the getNavupO method of the button 311-1, judges 
that the focus destination is the button 311-2, transits the focus 
5 correspondingly, and executes the blur () method for the button 311-1 
and the f ocus ( ) method for the button 311-2. Similarly, when the 
lower arrow key 91-4, left arrow key 91-5 or right arrow key 91-3 
is depressed, the focus transits to the corresponding one of the 
button 311-4, button 311-5 and button 311-3. 

10 Reverting to Fig. 16, MInteractible 318 also issues an 

event when the focus transition occurs, i.e., the f ocus ( ) method 
or blur() method is called. When this event is generated, the 
above-described ECMA script interpreter 281 (Fig. 14) analyzes and 
executes as a script the character string returned by a getOnfocus () 

15 method or a getOnblur ( ) method provided by MInteractible 318. 

An MStream interface (hereinafter described as MStream) 
319 defines the behavior of media (hereinafter called a stream) 
changing their expression state with a lapse of time, such as audio 
sounds and moving images 312. A component mounted with MStream 319 

20 becomes the object of a synchronizing process. This interface 
provides a getStreamPosition ( ) method or a setStreamPosition ( ) 
method. Acquisition and setting of a reproduction position of a 
stream become therefore possible . A getSartLatency ( ) method allows 
to acquire a wait time until reproduction of the stream becomes 

25 reproducible (a wait time for preparation) . Further, a playStream ( ) 
method or a stopStream() method allows the reproduction of a stream. 
The playStream () method can designate the time to the reproduction 
start as an argument . 
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An MStreamContainer interface (hereinafter, described as 
MStreamContainer ) 320 defines the behavior of a component which 
executes the synchronization control of a plurality of MStreams 319. 
MStreamContainer 320 holds reference to each MStream component 
5 performing synchronization. 

Fig. 18 shows an example of a sequence when 
MStreamContainer 320 executes the control of a plurality of MStream 
components . 

In the example shown in Fig. 18, it is assumed that a scene 

10 has a container mounted with MStreamContainer 320 (hereinafter, 
simply described as a container) and audio 233 and moving images 
234 and that the container 332 performs the synchronization control 
of the audio 233 and moving images 234. 

As indicated by an arrow 341, first the ECMA script 

15 interpreter 281 executes a call of the playStream() method from the 
container 332 by using an argument 0.0. The container 332 
immediately reproduces the audio 233 and moving images 234 from the 
tops of the streams. 

It is supposed in this case that times taken to prepare 

20 for the start of reproduction of the audio 233 and moving images 
234 are different. In order to solve this problem, as indicated 
by arrows 342 and 343, the container 332 calls the getStartLatency ( ) 
method relative to the audio 233 and moving images 234 to acquire 
the times tl and t2 taken for each component to become possible to 

25 reproduce the stream. 

Next, the container 332 obtains the maximum value tO among 
these times (tl, t2) , and as indicated by arrows 344 and 345, executes 
the playStream method ( ) of each component by using this value as 
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an argument. Since the time tO is equal to or longer than the time 
tl or time t2, it can be considered that the preparation for stream 
reproduction of the audio 233 and moving images 234 is completed 
before the lapse of the time tO, so that after the lapse of the time 
5 tO, these components can start the reproduction of the streams 
synchronously (audio reproduction 346 and moving image reproduction 
34 7 are executed synchronously) . 

Also in the case that there are three or more components 
on the side being controlled, synchronous reproduction of the 

10 streams becomes possible by similar processes. 

Reverting to Fig. 16, MStreamContainer 320 does not 
define the function of the synchronizing mechanism and its 
realization method. Therefore, it is possible to realize a 
synchronization control component by using logic other than the 

15 above-described logic (shown in Fig. 18). The example of the 
sequence shown in Fig. 18 is based upon generally the same thinking 
as the synchronous execution process of JMF (Java(R) Media 
Framework) (trademark) developed by Sun Microsystems. 

Each interface constituting the mmcomponent package has 

20 been described above. In the example shown in Fig. 16, the button 

311 and moving images 312 are shown as the examples of the components 
mounting these interfaces. Since the button 311 has its entity on 
the 3D virtual space, it is mounted with M3DComponent 313, and further 
in order to allow an interactive manipulation by the remo-con key 

25 91 (Fig. 8), MInteractible 318 is mounted. Further, the moving 
images 312 are mounted with M3DComponent 313 since the moving images 

312 have its entity on the 3D virtual space, and further MStream 
319 is mounted since the moving image data having the time axis is 
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reproduced. 

Next, with referent to Fig. 19, an example of software 
of the browser will be described. 

The browser 251 is constituted of a loader unit 362, a 
5 scene data -unit 363, a drawing unit 364, an ECMA script processing 
unit 365, an event processing unit 366 and a control unit 361. 

A component loader 367 in the loader unit 362 loads 
(inputs) the scene file (archive file) 292 (Fig. 15) of a scene, 
divides it into a component group 368 managed based upon the 
10 above-described ID value and script data 369 corresponding to the 
script file 303 (Fig. 15) , and supplies these to the scene data unit 
363. Specifically, the component loader 367 reconfigure the object 
file constituting the scene from a group of class files 302 contained 
in the loaded scene file 292 (Fig. 15) and from the instance file 
15 301. 

The script data 369 is supplied to a script interpreter 
371. The script interpreter 371 is an interpreter for the 
above-described ECMA script, scans the script data 369, manipulates 
the component in accordance with the contents of data when the content 
20 is executed, or executes a designated function by supplementing an 
event generated by the component . 

As the function of this type, the ECMA script processing 
unit 365 has an extension function 370, and a corresponding function 
is supplied to the script interpreter 371 when appropriate. Namely, 
25 the extension function 370 is a function group for providing 
functions such as scene transition, and can call the ECMA script. 

A 3D drawing engine 373 in the drawing unit 364 is used 
for acquiring a corresponding component from the component group 
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368 and rendering, as a reproduction screen, 3D graphics to be 
displayed on the display (output unit 187 in Fig. 12) of the reception 
terminal 113. In this example, although the 3D drawing engine 373 
uses a drawing engine of Java(R), the invention is not limited 
5 thereto. 

An event/focus management unit 372 of the event 
processing unit 366 is used for distributing an input from the input 
unit 186 to the component holding the focus. Further, as described 
above, if the input from the input unit 18 6 is an input from any 
10 one of the upper, lower, right and left keys 91-2 to 91-5 shown in 
Fig. 8, the focus transition is performed. 

The control unit 361 is used for managing the behavior 
of the main body of the browser 251 such as activation and 
termination. 

15 Next, with reference to Fig. 20, description will be made 

on the life cycle management of components by the browser 251. After 
reading a scene, the browser 251 manages the life cycle of components 
contained in the scene, the statuses of a component being shown in 
Fig. 20. 

20 Namely, the status of a component is one of the statuses: 

a serialized status (hereinafter called a status A) , a deserialized 
status (hereinafter called a status B) , an initialized and inactive 
status (hereinafter called a status C) , an active status 
(hereinafter called a status D) and a discarded status (hereinafter 
25 called a status E) . 

Next, with reference to Fig. 20 and a flow chart of Fig. 
21, a scene activation process by the browser 251 will be described. 

When a designated scene is activated, at Step SI shown 
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in Fig. 21, the browser 251 deserializes (loads) a component 
(component in the status A shown in Fig. 20) group serialized in 
the instance file 301 of the scene file 292 (Fig. 15) . 

Namely, as indicated by an arrow 391 in Fig. 20, a 
5 readObject() method is executed so that the component transits from 
the status A to the status B. 

At Step S2 in Fig. 21, the browser 251 registers therein 
the component loaded by the process at Step SI by using the ID value 
as a key, and at Step S3 executes the initialize () method of the 
10 component to initialize the component . In this manner, as indicated 
by an arrow 392 shown in Fig. 20 the component transits from the 
status B to the status C. 

At Step S4 in Fig. 21, the browser 251 judges whether the 
component initialized by the process at Step S3 is mounted with 
15 M3DComponent 313 (Fig. 16) or MBackground 314 (Fig.- 16), and if it 
is judged that it is mounted, then at Step S5 adds the component 
to the 3D virtual space to advance to Step S6. 

In contrast, if the browser 251 judges at Step S4 that 
the component is not mounted with M3DComponent 313 or MBackground 
20 314, the process at Step S5 is not executed to advance to Step S6. 

At Step S6, the browser 251 binds the component to the 
script interpreter 371 (Fig. 19) (reference to the component is 
passed to the script interpreter 371) in order that the ECMA script 
can manipulate the component and an event generated by the component 
25 is notified to the ECMA script. 

After the component is bound to the script interpreter 
371, it becomes possible that script code executes a method of the 
component and that when the component issues an event, script code 
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designated in advance in accordance with the type of the code is 
executed (the process of this kind is hereinafter called activation 
of an event handler) . 

At Step S7, the browser 251 judges whether the component 
5 is mounted with MPresentable 316 (Fig. 16) and whether the 
initialyActive ( ) method returns true (initialyActiv=true) . 

If the browser 251 judges at Step S7 that the component 
is mounted with MPresentable 316 (Fig. 16) and that the 
initialyActive ( ) method returned true (initialyActive=true) , then 
10 at Step S8 the run ( ) method is executed, f or the component to start 
presentation . 

In this manner, as indicated by an arrow 393 the component 
transits from the status C to the status D. Namely, the component 
is set active. 

15 During reproduction of the content, if the stop() method 

is called from the script interpreter 371 (Fig. 19) as indicated 
by an arrow 394, the component transits from the status D (active 
status) to the status C (inactive status) . As indicated by an arrow 
393, if the run() method is called thereafter from the script 

20 interpreter 371, the component transits again from the status C to 
the status D. 

Namely, during reproduction of the content, the component 
is switched between the statuses C and D when appropriate. 

Reverting to Fig. 21, after executing the process at Step 
25 S8, the browser 251 advances to Step S9. 

In contrast, if the browser 251 judges at Step S7 that 
the component is not mounted with MPresentable 316 (Fig. 16) or that 
it is not initialyActiv=true, then the process at Step S8 is not 



executed to advance to Step S9. 

At Step S9, the browser 251 reads the script file 303 from 
the scene file 292, analyzes and executes it. 

Next, with reference to Fig. 20 and a flow chart of Fig. 
5 22, a scene ending process by the browser 251 will be described. 

When the content is completed, at Step S21 in Fig. 22, 
the browser 251 executes the ending process by executing the 
destroy () method relative to each component. 

In this manner, as indicated by arrows 395 and 396 in Fig. 
10 20, the component transits from the status C or D to the status E. 

At Step S22, the browser 251 deletes the registration of 
the component . 

At Step S23, the browser 251 judges whether the component 
is mounted with M3DComponent 313 (Fig. 16) or MBackground 314 (Fig. 
15 16), and if it is judged that the component is mounted with 
M3DComponent 313 or MBackground 314 (if the component is displayed 
in the 3D space) , then at Step S24, deletes the components from the 
3D virtual space to advance to Step S25. 

In contrast, if the browser 251 judges at Step S23 that 
20 the component is mounted with neither M3DComponent 313 nor 
MBackground 314 (the component is not displayed in the 3D virtual 
space) , then the process at Step S24 is not executed to advance to 
Step S25. 

At Step S25, the browser 251 releases the interpreter bind 
25 of the component with the script interpreter 371. 

Next, with reference to Fig. 23, description will be made 
on an example of a process of event distribution and focus management 
by the browser 251. 
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Fig. 23 shows a message exchange sequence among the 
event/focus management unit 372 (Fig. 19), two components 
(corresponding to two buttons 311-1 and 311-2 in Fig. 17) mounted 
with MInteractible 318 (Fig. 16) , and the script interpreter 371 
5 (Fig. 19) . 

As indicated by an arrow 411 in Fig. 23, when a user 401 
depresses any key 91 (Fig. 8), the event/focus management unit 372 
notifies this effect by executing the keypress () method relative 
to the component (in this example, the button 311-1) presently having 
10 the focus, as indicated by an arrow 412. 

Next, if the input key is one of the arrow keys 91-2 to 
92-5, transition of the focus is conducted. 

It is now assumed that the user 401 depresses the upper 
arrow key 91-2 . First, as indicated by an arrow 413, the event/focus 
15 management unit 372 executes the getNavup ( ) method relative to the 
button 311-1, and acquires from its return value an ID of the 
component at a focus transition destination (in this example, the 
ID value is "button B"). 

Next, as indicated by an arrow 415, the event /focus 
20 management unit 372 executes the blur () relative to the button 311-1 
to notify a loss of the focus. 

The button 311-1 issues an event (onbluer event) because 
of the focus loss. As indicated by an arrow 416, the script 
interpreter 372 acquires this onbluer event, and as indicated by 
25 an arrow 417, analyzes and executes the character string returned 
by the getOnblur() from the button 311-1 as the script (executes 
the event handler) . 

Next, as indicated at an arrow 418, the event /focus 
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management unit 372 notifies this effect by executing the focus () 
method relative to the component button 311-2 at the focus transition 
destination. 

The button 311-2 issues an event in response to the 
5 acquisition of the focus. As indicated by an arrow 419, the script 
interpreter 371 acquires this in focus event, and as indicated by 
an arrow 420, analyzes and executes the character string returned 
by the getOnfocusO from the button 311-2 as the script (executes 
the event handler) . 

10 As indicated by an arrow 421, as the user 401 detaches 

thereafter the finger from the depressed key (executes the key input) , 
the event/focus management unit 372 notifies this effect by 
executing the keyup ( ) method relative to the component button 311-2, 
as indicated by an arrow 422. 

15 Next, description will be made on the application manager 

applied to which is a preferred embodiment of the present invention. 

As shown in Fig. 13, the application manager 271 (Fig. 
14) is software for downloading the multimedia content 252 and 
browser 251 (distribution software 243) from the network in 

20 accordance with an URL input (arrow 241) from a user, and executing 
it, the control being passed to the browser 251 side after it is 
activated. 

With reference to a flow chart in Fig. 24, description 
will be made on an example of the process by the application manager. 
25 As shown in Fig. 13, when a user supplies the reception 

terminal 113 with an URL of a certain content (startup scene) , at 
Step S41 the application manager 271 inputs URL of the start scene. 

If the network 112 is the Internet and the HTTP protocol 
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is used on the Internet, the URL is written, for example, as in the 
following. 

URL " jar :http: //w. w. w. 

sony . co. jp/ . . . /startup, jar ! /startup, dat 11 
5 This URL indicates the instance file (startup.dat) 301 

in the scene file (archive file (startup, jar) 292 (Fig.. 15)) at the 
designated URL. 

Reverting to Fig. 24, at Step S42 the application manager 
271 acquires (downloads) the startup scene file 292 at the URL input 
10 by the process at Step S41. 

At Step S43 the application manager 271 sets paths in 
accordance with a URL of each scene other than the startup scene 
designated in the startup scene file 292 in order to allow the class 
file to be loaded at each URL. 
15 As Step S44, the application manager 271 sets a path in 

accordance with a URL of the browser 251 designated in the startup 
scene file 292 in order to allow the browser to be acquired. 

At Step S45 the application manager 271 loads (acquires) 
the archive file 291 of the browser 251 and activates the browser 
20 251. 

Thereafter, the application manager 271 passes its 
process to the browser 251 which in turn reproduces the startup scene 
at Step S46. 

Next, with reference to Fig. 25, description will be made 
25 on an authoring system 441 applied to which is a preferred embodiment 
of the present invention. 

The authoring system 4 41 is fundamentally software to be 
installed in the multimedia content producing apparatus 123 as 
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described earlier (as shown in Fig. 9) . 

As shown in Fig. 25, the authoring system 441 has a 
window-based user friendly GUI. The authoring system 441 is not 
limited particularly if the above-described multimedia content can 
5 be produced. In this example, it is assumed that the authoring 
system has the following functions (41) to (45) . 

(41) A producing function of newly producing a 
multimedia content, a storing function of storing an edited 
multimedia content and an execution function of executing a 
10 multimedia content. 

Conceivable as the storing function is a function of 
storing a multimedia content in an executable state or a function 
of storing a multimedia content in an internal form specific to the 
authoring system 4 41. 
15 (42) A content presenting function of reading a 

component developed basing upon the architecture according to the 
preferred embodiment of the present invention and presenting it to 
a user. 

(4 3) A user interface environment function of providing 
20 a visual user interface environment to add an arbitrary component 
to a multimedia content and customizing the component group. 

(44) An information presenting function of visually 
presenting screen configuration information of a multimedia content 
and information of a component group contained in the multimedia 
25 content. 

(4 5) A script edit supporting function of supporting 
edition of the ECMA script. 

As shown in Fig. 25, the authoring system 441 displays 
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the following items (51) to (54) on the display. 

(51) Component List 451 

Java (R) Beans can perform distribution in the Jar archive 
form. The component list 4 51 loads components provided in this form 
and displays them in a list form. This component group is disposed 
in a particular directory of the authoring system 4 41. The component 
list 451 adds the component selected by a user from the list to the 
content under production. 

(52) Content Tree 452 

The content tree 4 52 displays the data structure of a 
multimedia content in a tree form. If the data structure of the 
multimedia content is changed (if a scene is added or deleted or 
a component is added or deleted) , the content tree 452 reflects this 
change upon the tree. When a user selects a node of the tree, the 
content tree 4 52 sets the corresponding scene or component to a select 
state . 

(53) Property Table 454 

The property table 454 displays customization 
information of selected components in a table form. This allows 
the value of an attribute selected in the table to be edited. When 
the attribute value of a component is changed by another process, 
the property table 454 reflects this change upon the table. 

(54) Layout Panel 455 

The layout panel 4 55 provides a 3D virtual space 
corresponding to the selected scene. When a component is added to 
the scene, the layout panel 455 displays it. When the data structure 
of a scene is changed (when a scene is added or deleted or a component 
is added or deleted), the layout panel 455 reflects it. In this 
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manner, a content producer can immediately confirm the state of an 
editing content. The layout panel 455 can also set the position 
and posture of a component with a mouse. 

(55) Script Editor 453 

5 The script editor 4 53 is used for inputting and editing 

code of the ECMA script of a selected scene. 

(56) Others 

The authoring system 441 is provided with other 
components such as a menu and a tool bar. 
10 The authoring system 441 may be a content producing 

environment having no graphical environment such as shown in Fig. 
25. 

As described earlier, a multimedia content produced by 
the authoring system 441 of this type is supplied to the server 124 

15 as shown in Fig. 9, and as shown in Fig. 13, distributed from the 
server 124 to the reception terminal 113 via the network 112, along 
with the above-described browser 251 as one piece of the distribution 
software 243. As described above, the process of combining the 
multimedia content and browser 251 to obtain the distribution 

20 software 243 may be executed by the multimedia content producing 
apparatus 123 (Fig. 9) mounted with the authoring system 441 or by 
the server 124. 

The reception terminal 113 displays (acquires the state 
of the content reproduction 244) the multimedia content 252 

25 contained in the distribution software 243 on the display or the 
like by using the browser 251 contained in the distribution software 
243. At this time, an image 461 such as shown in Fig. 26 is displayed 
on the browser on the display. 
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As above, the authoring system 441 can provide an 
efficient development environment to a content producer (a user of 
the multimedia content producing apparatus 123 (Fig. 9)) and can 
be used in a straightforward manner even by a content producer having 
5 no sophisticated knowledge of software. Therefore, content 
producers,, particularly of the occupation type such as designers 
and creators, can produce the content of high quality in a 
straightforward manner by using the authoring system 441. 

However, a high cost is generally imposed on developing 

10 the system of the authoring system 441 so that it is difficult for 
users (content producers) to have it at a reduced price. 

In order to prevail the technologies applied to which are 
the above-described preferred embodiments, it is supposed that a 
multimedia content aiming at evaluation or research is required to 

15 be produced. 

It is therefore necessary to supply users (content 
producers) with the environment capable of producing a multimedia 
content readily or at a reduced cost without acquiring the system 
such as the authoring system 441. 

20 According to a preferred embodiment of the present 

invention, conversion software is prepared which can be used as an 
alternative of the authoring system 441. The conversion software 
converts the format of a multimedia content written in the format 
using techniques of representing a data structure by a text 

25 (hereinafter called text data structure expression techniques) , 
into the format (format such as shown in Fig. 15 described above) 
used when the above-described server 124 (Figs. 9 and 13) distributes 
a multimedia content. 
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The type of the text data structure expression techniques 
is not limited particularly, but techniques originally developed 
by the developer of the above-described conversion software may be 
used or methods in conformity with open standards such as XML 
5 (Extensible Markup Language) may be used. 

In this case, if the conversion software developer 
develops the conversion software adopting the text data structure 
expression techniques originally developed by the developer, for 
the format of a multimedia content (multimedia content before 
10 conversion) whose format is to be converted, then the development 
cost can be suppressed. 

If the development cost is desired to be further reduced, 
this reduction becomes possible if the conversion software developer 
develops the conversion software adopting the text data structure 
15 expression techniques in conformity with the open standards, for 
the format of multimedia content whose format is to be converted. 

A user (content producer) can therefore acquire the 
conversion software of this type in a straightforward manner and 
can produce a multimedia content of the same format at that produced 
20 by the authoring system 441 by using the conversion system, even 
if the above-described authoring system 441 is not acquired. 

The conversion software of this type is therefore 
software for converting the format of a multimedia content 
(multimedia content (data) written in the XML format is called XML 
25 data) written in the text format using XML (hereinafter called an 
XML format), into the format shown in Fig. 15 described above. 

XML is a general data descriptive language using a text 
and standardized by the above-described W3C. Similar to HTML, XML 
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expresses the data structure through markup. 

The following five points can be enumerated as the 
advantages of adopting XML for the text data structure expression 
techniques. 

5 Namely, the first point is that XML has a very high 

propagation degree as compared with other general text data 
structure expression techniques. The second point is that the XML 
standards are open and easy to use. The third point is that the 
XML grammar is simple and easy to understand. The forth point is 

10 that there are a number of libraries for processing data expressed 
by XML. The fifth point is that there is a large amount of 
application software (a dedicated editor or the like) for. supporting 
to generate data using XML which can be acquired free of charge or 
at a relatively reduced price. 

15 Fig. 27 shows an example of the structure of software of 

the multimedia content producing apparatus 123 mounted with the 
conversion software applied to which is a preferred embodiment of 
the present invention. 

As shown in Fig. 27 , the multimedia content producing 

20 apparatus 123 is provided with a text editor 501, a conversion 
software 502 and a DTD (Document Type Definition) 503. 

The text editor 501 is application software used when a 
content producer 517 writes a multimedia content in the XML format. 
Namely, the text editor 501 generates XML data (multimedia content 

25 in the XML format) 511. In this case, in the example shown in Fig. 
27, source code 512 of the ECMA script of each scene is also written 
as part of the XML data 511. The source code 512 of the script may 
be output as another file different from the XML data 511. 
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The text editor 501 is not limited particularly if the 
XML data 511 can be generated. For example, as the text editor 501, 
a general XML editor may be used such as iPAD (ai pasd) (trademark) 
and Athens Editer (trademark) acquirable at a relatively reduced 
5 price. 

The conversion software 502 is software for converting 
the XML data 511, a file 513 such as text/still images and the like 
and a class file 514 of components, into a multimedia content 516 
(multimedia content 516 constituted of a plurality of scene files 

10 292 similar to those shown in Fig. 15) of the format used when the 
above-described server 124 (Figs. 9 and 13) distributes the 
multimedia content. 

The file 513 of text/still images and the like corresponds 
to the file 304 of text/still images and the like shown in Fig. 15, 

15 and in the example shown in Fig. 27, is supplied in advance from 
the mono-media producing apparatus 121 (Fig. 9) to the multimedia 
content producing apparatus 123 to be assembled with the XML data 
511. 

The class file 514 of components corresponds to the class 
20. file 302 of component shown in Fig. 15, and in the example shown 
in Fig. 27, is supplied in advance from the mono-media producing 
apparatus 122 (Fig. 9) to the multimedia content producing apparatus 
123 as the class file of components to be contained in the XML data 
511. 

25 In the example shown in Fig. 27, the conversion software 

502 also executes the process of combining the multimedia content 
516 and the archive file 291 of the above-described browser to acquire 
the distribution software and supplying it to the server 124. 
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However, as described above, the execution site for this 
process is not limited particularly, but it may be executed by 
software (not shown) different from the conversion software 502 or 
by the server 124 (Fig. 9) instead of the multimedia content producing 
5 apparatus 123. 

DTD 503 is data representative of the definitions of the 
type and occurrence rules of tags appearing in the data (in this 
example, the XML data 511) written in the XML format. The details 
of DTD 503 and the tag will be later described. 
10 Next, with reference to a flow chart shown in Fig. 28, 

the process by the conversion software 502 shown in Fig. 27 will 
be described. 

In this example, the XML data 511 is assumed to be, for 
example, the following multimedia content in the XML format. 

15 Specifically, the XML data 511 of this example is 

converted into the multimedia content 516 by Steps S61 to S71 of 
the conversion software 502 to be later described, and supplied to 
the server 124 (Fig. 9) as the distribution software 515 along with 
the archive file 291 of the browser, and the server 124 distributes 

20 the distribution software 515 to the reception terminal 113 (Fig. 
9) via the network 112 (Fig. 9) . In this case, the reception terminal 
113 reproduces the distributed multimedia content 516 by using the 
browser corresponding to the distributed archive file 291 of the 
browser to display an image 521 such as shown in Fig. 29 on the display 

25 (the output unit 187 of Fig. 12) of the reception terminal 113. 

Specifically, the XML data 511 (XML data 511 for 
displaying the image 521 of Fig. 29) of this example is written as 
shown in Figs. 30 to 61. 
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In the XML data 511 (XML data 511 constituted of portions 
511-1 to 511-32) shown in Figs. 30 to 61, a portion surrounded by 
"<" and ">" such as M <component>" and "<scene>" is called a tag 
similar to HTML. A portion surrounded by a pair of tags (e.g., 
5 "<component>" and < /component >" is called an element. 

For example, the portion 511-1 of the XML data shown in 
Fig. 30 expresses the following. 

Namely, the XML data 511 is the multimedia content having 
the name "welcome" as indicated in a portion a, and contains a scene 
10 having the name "startup" as indicated in a portion b. As indicated 
in a portion c, the scene "startup" contains a component having an 
SSpotLight class as its entity. As indicated in portions d and e, 
the value of an "id" attribute of this component is a character string 
"sSpotLightl" . The description of the other portions of the portion 
15 511-1 and the portions 511-2 to 511-32 is omitted because of 
redundancy. 

Reverting to Fig. 28, first at Step S61 the conversion 
software 502 shown in Fig. 27 verifies the XML data 511 by using 
DTD 503. 

20 Namely, it is necessary to define the type and occurrence 

rules of tags appearing in the data written with the XML format (in 
this case, the above-described XML data 511) for each of its data 
structure. Although several types of the definition methods have 
been proposed, the above-described DTD is the most widely used one. 

25 In this example, DTD 503 is stored in advance in the 

multimedia content producing apparatus 123 (the storage unit 148 
or the like shown in Fig. 10) , and the conversion software 502 checks 
whether the XML data 511 actually produced is in conformity with 
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the rules, by referring to DTD 503. This process is herein called 
verification . 

In this example, for example, DTD 503 is written like a 
portion 503-1 in Fig. 62 and a portion 503-2 in Fig. 63. 
5 On DTD 503 shown in Fig. 62 and Fig. 63, "*" indicates 

"repetition". For example, a portion A <!ELEMENT scene (script, 
component*) > shown in Fig. 62 indicates that in the element "scene", 
an element "script" appears once and then the element "component" 
appears an arbitrary number of times . 

10 "# REQUIRED" indicates "required" and "ID" indicates a 

value "ID" the attribute can have (identifier (unique value) ) , 
respectively. For example, a portion B < ! ATTLIST scene name ID 
#REQUIRED> shown in Fig. 62 indicates that the element "scene" has 
an attribute "name" and that its value (ID) is always required to 

15 be designated and is required to be unique in the XML data 511. 
Namely, it indicates that the existence of a plurality of scenes 
(element "scene") having the same name (value (ID) of the attribute 
"name") . 

A portion A <!-- scene constituting content — > shown in 
20 Fig. 62 indicates a comment for <! ELEMENT scene (script, 
component*) >, and a portion B < ! — name of scene (required) — > shown 
in Fig. 62 indicates a comment for <! ATTLIST scene name ID #REQUIRED>. 

Similarly, the other portions of DTD 503 (each tag in the 
portions 503-1 excepting the portions A and B in Fig. 62 and each 
25 tag in the portion 503-2 in Fig. 63) write the comments (corresponding 
* to the portions <! — — >) for respective portions (tags). 

Although the description of the other portions is omitted because 
of redundancy, the meaning of each tag can be presumed easily from 
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the continent corresponding to each portion (tag) . 

After the XML data 511 is verified by using DTD 503 in 
the above manner in the process at Step S61 shown in Fig. 28, a process 
at Step S62 is executed. 
5 Namely, since the XML data (multimedia content of the XML 

format) 511 is constituted of a plurality of scenes, at Step S62 
the conversion software 502 selects a scene to be processed 
(hereinafter called an object scene) from the plurality of scenes. 

At Step S63 the conversion software 502 acquires the class 
10 type of each component contained in the object scene from the XML 
data 511. 

In accordance with the class type acquired by the process 
at Step S63, the conversion software loads (acquires) the class file 
514 of components corresponding to each class at Step S64. 
15 — * At Step S65 the conversion software 502 generates a 

Java(R) object corresponding to each component contained in the 
object scene. 

At Step S66 the conversion software 502 sets the value 
of the attribute designated by the XML data 511 to the Java (R) object 
20 generated by the process at Step S65. 

At Step S67 the conversion software 502 serializes each 
Java (R) object generated by the process at Step S65 and set with the 
value of each attribute by the process at Step S66, to the instance 
file (corresponding to the instance file 301 shown in Fig. 15 
25 described above) . 

At Step S68 the conversion software 502 extracts source 
code 512 of the ECMA script of the object scene from the XML data 
511, and outputs it as the script file (corresponding to the script 
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file 303 shown in Fig. 15 described above) . 

At Step S69 the conversion software 502 archives the 
instance file serialized by the process at Step S67 , the script file 
output by the process at Step S68, the class file group 514 of 
5 respective components contained in the object scene, and the file 
group 513 of text/still images and the like to be assembled with 
the object scene, as a scene file 292 corresponding to the object 
scene . 

Namely, the conversion software 502 generates the scene 
10 file 292 corresponding to the object scene such as shown in Fig. 
15. 

At Step S70 the conversion software 502 judged whether 
all the scenes were selected as the object scene. 

If it is judged at Step S70 that all the scenes were not 
15 selected as the object scene, the conversion software 502 returns 
its process to Step S62 whereat the next scene is selected from the 
scenes still not selected, to repeat the following processes. 

Namely, the conversion software 502 executes the 
above-described processes from Step S63 to Step S69 for each of all 
20 the scenes, to generate the scene files 292 corresponding to all 
the scenes. 

After the conversion software 502 generates the scene 
file 292 corresponding to the last scene, it judges at Step S70 that 
all the scenes were selected as the object file, and combines the 
25 multimedia content 516 constituted of the scene files 292 
corresponding to all the files and the archive file 291 of the browser 
to generate the distribution software 515 and output it to the server 
124. 
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When the server 124 distributes thereafter the 
distribution software 515 to the reception terminal 113 (Fig. 9) 
via the network 112 (Fig. 9), as described above the reception 
terminal 113 reproduces the distributed multimedia content 516 by 
5 using the browser corresponding to the archive file 291 of the 
distributed browser so that the image 521 such as shown in Fig. 29 
is displayed on the display (output unit 187 shown in Fig. 12) of 
the reception terminal 113. 

As above, in place of the above-described authoring 

10 system 4 41, the content producer 517 can acquire the above-described 
text editor 501 and conversion software 502 free of charge or at 
a reduced price, and can use the text editor 501 in a straightforward 
manner and conversion software 502 acquired in such a manner. It 
is therefore possible to produce the distribution software 515 

15 containing the multimedia content 516 in the same format (format 
shown in Fig. 15) as that generated by the authoring system 441, 
without using the authoring system 441. 

However, the conversion software 502 has a lower 
productivity of a multimedia content than that of the authoring 

20 system 4 41. In other words, although the authoring system 441 has 
specific functions of the following first to fourth functions as 
described above, the conversion software 502 does not have these 
functions. 

Namely, the first function is a function of providing a 
25 WYSIWYG (What You See Is What You Get) environment. 

A user interface environment capable of producing a 
content by visually confirming and editing final presentation, is 
herein called the WYSIWYG environment. If any change is made in 
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a content, the WYSIWYG environment allows its effect to be 
immediately confirmed so that the effect of considerably improving 
the content production can be obtained. 

The second function is a function of providing a display 
5 of an intelligible form such as a list and a tree of various 
information necessary for content production such as the 
configuration of a content and customizing information of an editing 
object component. 

The third function is a function of supporting 
10 customization of components. Namely, the third function is a 
function of supporting an input of the attribute value of a component 
by using GUI components such as a text field, a check box and a slider, 
and checking whether a user input value is in a correct range. 

The fourth function is a function of providing processes 
15 such as copy, paste, operation cancel and retry. 

It is necessary that a user writes a content correctly 
in conformity with the predetermined text format by using the text 
editor 501. The text editor has the characteristics that if there 
is a statement miss, an error occurs during conversion. It is 
20 therefore hard to say that the text editor is software having a high 
productivity of a multimedia content. 

It is therefore considered that the production 
environment using the text editor 501 and conversion software 502 
is complementary with the content production environment using the 
25 authoring system 441. 

The text format using XML or the like may be adopted as 
the format of the distribution software itself (e.g., the 
distribution software 242 shown in Fig. 13 and the distribution 



software 515 shown in Fig, 27) to be distributed by the 
above-described server 124. In this case, there occurs the new 
issues that the data size becomes large and it takes time to load 
data. Therefore, as described above, the distribution software 
5 distributed by the server 124 is preferably distributed as a binary 
format file. 

As described above, in the preferred embodiment of the 
present invention, since the component-based architecture is 
adopted, a content-specific function can be built in a multimedia 

10 content more easily than a conventional method (it can be said that 
an extension performance is high) 

Also in the preferred embodiment of the present invention, 
since 3D-based presentation is realized, it is possible to realize 
an intelligible and attractive multimedia viewing environment. 

15 Also in the preferred embodiment of the present invention, 

an arbitrary component is assembled with a content on the authoring 
tool, each component is customized in a graphical environment, and 
an environment is provided which can easily control a component by 
the script language. It is therefore possible for designers and 

20 content creators to produce highly functional content easier than 
a conventional method (it can be said that productivity is high) . 

Further in the preferred embodiment of the present 
invention, the conversion software is provided which converts the 
format of a content written as text data into the same format as 

25 that produced by the above-described authoring system. The content 
producer can therefore produce and edit a content by using a general 
text editor. The text editor of this kind is often bundled with 
an OS or is often freeware or shareware, and can be acquired free 
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of charge or at a reduced price. 

Further, since this conversion software can be developed 
much more inexpensively than the above-described authoring system, 
a load of cost upon a content producer (user) can be lessened. Still 
further, although a specific grammar may be used for writing a content 
by text, if a system in conformity with open standards such as XML 
is adopted, the development cost of the conversion software can be 
further reduced and the tool easier to be used by a content producer 
(user) can be realized. 

Furthermore, in the preferred embodiment of the present 
invention, since the user interface ensuring presentation 
consistency and using an arrow key or decision key is adopted, 
operations easier and more common than a conventional operation can 
be provided for various digital electronic home appliances including 
personal computers, and the same content can be distributed to 
various digital electronic home appliances including personal 
computers . 

As above, the preferred embodiment of the present 
invention permits solving the above-described first to sixth issues. 

Programs constituting various software of the preferred 
embodiment of the present invention are installed from a network 
or a recording medium. 

As shown in Fig. 10, 11 or 12, the recording medium is 
constituted of a removable recording medium (including a magnetic 
disk (including a floppy disk) , an optical disk (including a CD-ROM 
(Compact Disk-Read Only Memory) , and a DVD (Digital Versatile Disk) ) , 
package media such as a semiconductor memories 151, 171 or 191, 
respectively storing the programs and distributed to users 
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separately from the apparatus main body, or ROM 142, 162 or 182 or 
a hard disk in the storage unit 148, 168 or 188, respectively storing 
the programs and supplied to users in the state that ROM or hard 
disk is loaded in advance in the apparatus main body. 

In this specification, Steps writing programs to be 
stored in a recording medium include obviously the processes to be 
executed time sequentially in the order of written Steps, or the 
processes to be executed in parallel or separately which are not 
necessarily required to be executed time sequentially. 

Further, in this specification, system (excepting the 
authoring system 441) means a whole apparatus constituted by a 
plurality of apparatuses and processing units. 

Industrial Applicability 

As described above, according to the present invention, 
it is possible to handle multimedia content. Further, according 
to the present invention, highly functional multimedia content can 
be handled in a straightforward manner. 
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